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full  order  singularly  perturbed  game,  while  solving  the  reduced  order  ones, 
reduced  order  near  optimal  strategies  are  obtained  and  well-posedness  is 
shown  for  two  classes  of  Leader-Follower  games.  Decision-dependent  information 
structure  is  employed  in  both  Nash  games  and  optimal  coordination  problems 
and  two  market  models  of  duopoly  with  this  type  of  information  structure  are 
extensively  analyzed  and  examined.  Finally,  sufficient  conditions  for 
existence  of  the  solutions  of  both  the  stochastic  optimal  coordination  problem 
with  decision-dependent  Information  structure  and  the  stochastic  Leader- 
Follower  team  problem  with  partial  decision-dependent  information  structures 
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CHAPTER  1 


INTRODUCTION 


I. 1.  Motivation  and  Background 

In  a  world  complicated  by  competing  multi-national  corporations  and 
vast  informational  data  bases  stored  in  high-speed  computers,  the  need  for 
optimizing  large  scale  economic  systems  has  gained  considerably  in  importance. 

It  is  thus  no  surprise  that  hierarchical  engineering  solutions  to  problems  of 
this  nature  have  attracted  a  great  deal  of  attention  from  researchers  in  the 
past  several  years.  In  particular,  the  theory  of  games  provides  formalization 
of  many  basic  problems  in  large  scale  systems,  i.e.,  problems  characterized 
as  having  several  decision  makers  acting  on  different  sets  of  information  with 
possibly  conflicting  goals. 

The  appearance  of  the  "Theory  of  Games  and  Economic  Behavior"  by 

J.  V.  Neumann  and  0.  Morgenstern  [1]  gave  the  Impetus  for  research  in  game 
theory.  Although  the  importance  of  the  theory  of  games  was  initially 
recognized  almost  exclusively  in  economics,  its  usefulness  and  recognition  as 

a  challenging  area  of  research  is  established  today  in  mathematics,  engineering, 
economics,  sociology,  and  political  science. 

A  basic  feature  of  the  theory  is  that  in  solving  a  game  theoretic 
problem,  one  is  faced  not  with  a  conventional  minimization  (maximization) 
problem,  but  with  a  conceptually  different  situation  altogether.  This  might 
be  expected,  however,  since  the  outcome  of  a  decision  maker  depends  not  only 
on  his  own  actions  and  chance,  but  also  on  complicated  interactions  with 
other  decision  makers.  The  theory  of  games  can  be  viewed  as  a  generalization 
of  centralized  and  decentralized  control  problems,  since  most  of  the  questions 
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poaed  in  control  theory  can  be  considered  in  a  game  theoretic  framework,  but 
their  solution  is  usually  more  difficult.  The  generality  and  unification  of 
several  special  cases  which  characterize  game  theoretic  results,  help  to 
counterbalance  the  difficulties  encountered  in  finding  them. 

Game  theory  reduces  the  ingredients  in  a  formalization  of  the  game 
problem  to  seven  essential  elements. 

1.  The  players  involved  in  the  game  (there  may  be  only  two,  or  possible  more 
players) . 

2.  A  description  of  the  interaction  both  among  the  players  and  the  players 
with  the  system.  (In  a  dynamic  setting,  a  difference  or  differential 
equation  describes  this  situation.) 

3.  The  information  structure  of  each  player,  describing  an  "information  space" 
which  contains  the  precise  information  gained  or  recalled  by  the  player  at 
every  stage  of  the  game. 

4.  The  decision  space  (of  alternative  course  of  action)  for  each  player. 

5.  The  admissible  strategies  of  each  player,  defined  as  mappings  from  the 
information  space  into  the  decision  space. 

6.  The  objective  function  of  each  player. 

7.  A  rational  equilibrium  solution  concept  which  takes  into  account  the 
relative  power  of  each  player  and  the  hierarchical  structure  of  the 
decision  process. 

Two  solution  concepts  which  are  of  particular  interest  here  are 
the  so-called  Nash  and  Leader-Follower  (LF)  games,  (see  [30], [3]).  A 
general  description  of  Nash  and  LF  games  raav  be  given  as  follows.  Let  7^  and 
7?  be  the  spaces  of  admissible  strategies  for  player  one  and  player  two. 
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respectively,  with  y1s  and  Y2e  r2‘  Let  J1^Y1,Y2^  and  J2^Y1’Y2^  be  the 
corresponding  cost  functions  of  the  two  players.  A  Nash  equilibrium  solution 
is  defined  as  follows: 

Definition  of  a  Nash  equilibrium:  (y°»Y2)  is  an  equilibrium  Nash  solution  if 
and  only  if 

Jl(Y°tY2)  £  Jl^Yi,Y2^ 

and  (1.1) 

<J2(y°.y2)  £  J2Cy°.y2) 

so  a  Nash  equilibrium  solution  assumes  that  if  one  player  minimizes  on  the 

basis  that  the  other  player's  strategy  is  known  and  it  is  at  equilibrium 

then  the  first  player  will  find  his  optimal  strategy  at  equilibrium. 

To  define  the  LF  equilibrium,  we  need  the  following  definition:  The 

rational  reaction  set  of  the  follower  (player  two)  to  the  permissible  strategies 

2 

of  the  leader  (player  one) ,  D  (y^)  is 

?  .  * 

D  (y x)  *1y2€  F,  such  that  J2 ^Y2 ,Yl' -  J2 ^Y1 ,Y2^  for  all 

Y2€  and  each  y^K 

Definition  of  the  LF  equilibrium:  A  permissible  pair  of  strategies 
2 

(y^S  y2^D  ( y 2_) )  *-s  said  to  be  in  global  LF  equilibrium,  with  player  one 
as  a  leader,  if 

J1^Y1,Y2^  ~ 

for  all  pairs  „ 

{yxs  r1,  y2€  d  (Yj_) }  • 

Nash  game  describes  a  situation  of  conflict  where  the  two  players  do  not 
trust  each  other  and  do  not  cooperate,  but  each  player  assumes  that  the  other 
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one  will  act  in  a  rational  way.  The  two  players  are  assumed  to  declare,  but 
not  necessarily  apply,  their  strategies  simultaneously,  and  both  players  know 
r  ,  r2*  Jl’  and  J2‘  The  8ame  forraalizes  situation  where  the  follower 
tries  to  minimize  for  a  given  choice  of  by  the  leader.  The  leader, 

who  wants  to  minimize  J^,  knowing  the  rational  reaction  of  the  follower  and 
having  the  power  to  declare  his  strategy  first,  wishes  to  announce  a  strategy 
y*  which  achieves  the  minimum  possible  J^.  The  leader  must  know  T^,  J^, 

and  J2,  but  the  follower  knows  the  strategy  y*  (not  the  strategy  value),  "2 
and  J2- 

The  LF  solution  concept  was  first  introduced  by  von  Stackelberg  [3] 
within  the  context  of  economic  competition.  It  was  generalized  to  the  dynamic 
game  case  by  Chen,  Cruz,  and  Simaan  ( [4] ,  (5] ,  [6])  .  There  are  different  types 
of  LF  strategies,  (a)  open-loop  strategies,  (b)  feedback  strategies,  and 
(c)  closed-loop  strategies.  References  [5]  and  [6]  provide  discussions  of 
these  various  types;  however,  we  will  confine  our  attention  primarily  to  the 
closed-loop  type.  It  was  thought  for  a  long  time  that  the  solution  of  closed- 
loop  strategies  would  be  impossible  to  obtain,  but  recently  several  successful 
attempts  have  been  made  in  this  direction,  see  [16],  [18],  and  [26].  In  [18] 
and  [26]  a  new  and  important  class  of  LF  games  of  the  closed-loop  type  has 
been  defined  and  developed.  In  this  class  the  leader  achieves  his  global 
optimal  payoff.  In  other  words,  the  leader  is  able  to  induce  the  follower  to 
play  with  him  as  a  team,  even  though  the  follower  optimizes  his  objective 
function. 

The  information  gained  or  recalled  by  each  decision  maker (DM) at  every 
stage  of  the  game  (the  information  structure),  is  crucial  to  the  solution  of 
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the  game.  For  a  given  Information  structure,  each  DM  tries  to  find  his 
optimal  strategy  (where  optimality  is  defined  according  to  the  solution  concept 
adopted  by  the  DM) .  A  different  information  structure  will  generally  require 
a  different  optimal  strategy,  and  hence  may  result  in  a  different  payoff. 

This  leads  to  a  natural  way  for  comparing  two  information  structures  in 
terms  of  the  maximum  payoff  that  can  be  achieved  through  their  use.  For  a 
detailed  discussion  of  information  structure  and  its  value,  the  reader  is 
referred  to  [44]  and  [48].  In  problems  involving  a  single  decision  maker,  the 
more  Information  the  DM  has,  the  better  off  he  is;  but,  for  problems  with 
many  decision  makers  this  is  not  generally  true,  as  was  shown  in  [54,55,57] 
within  the  context  of  Nash  games. 


1.2.  Contribution  and  Outline  of  the  Thesis 

In  this  section  we  will  outline  the  results  of  our  work,  relate 
them  to  other  existing  ones,  and  point  out  our  contributions. 

In  this  thesis  we  consider  two  important  aspects  of  information 
structures  in  Nash  and  LF  games.  These  two  aspects  are  preservation  of 
information  structure  and  decision-dependent  information  structure.  By 
preserving  the  information  structure  of  the  full  order  game,  while  solving  the 
reduced  order  ones,  we  obtain  reduced  order  solutions  which  are  equivalent  to 
the  full  order  ones.  But  by  using  decision-dependent  information  structure, 
we  obtain  solutions  which  are  usually  different  from  but  more  desirable  than 
the  solutions  obtained  by  using  normal  information  structures;  in  particular, 
in  LF  games,  by  using  decision  dependent  information  structure,  the  leader 


can,  under  certain  conditions,  achieve  any  feasible  solution  he  desires  for 
the  game. 

In  Part  One  of  the  thesis,  we  show  that  by  preserving  the  informa¬ 
tion  structure  of  the  full -order  singularly-perturbed  LF  games,  while  solving 
the  reduced-order  ones,  reduced-order  and  near-optimal  strategies  are  obtained 
and  well-posedness  is  achieved. 

Singular  perturbation  technique  is  used  to  decrease  the  order  of 
the  system,  and  hence  reduce  computation  and  alleviate  the  numerical 
"stiffness"  in  the  problem.  Alternatively  we  can  regard  the  model  reduction 
as  a  simplification  in  desiring  to  obtain  approximate  strategies  which  are 
asymptotically  optimal  but  which  involve  significant  reduction  in  computation. 

A  fundamental  question  is  whether  the  resulting  reduced  optimization  function 
is,  in  the  limit,  equal  to  the  full  order  optimization  function  for  each 
player  as  y  tends  to  zero.  In  linear  quadratic  control  problems  the  reduced 
and  the  full  order  optimization  are  equal  in  the  limit  as  u  tends  to  zero,  i.e. 
the  usual  singular  perturbation  procedure  is  well-posed  [9],  However,  in  games 
this  is  not  generally  true,  as  was  shown  by  a  counterexample  in  [10].  When 
the  usual  singular  perturbation  procedure  leads  to  an  ill-posed  solution,  it  is 
desirable  to  seek  a  modified  procedure  which  is  well-posed.  Such  well-posed 
order  reduction  have  been  obtained  for  linear  quadratic  Nash  games  [10]. 

Cases  for  which  the  usual  singular  perturbation  leads  to  a  well-posed  solution 
have  been  reported  in  [22]. 

In  Chapter  2  the  well-posedness  of  linear  closed-loop  LF  strategy 
is  considered.  When  the  space  of  closed-loop  LF  strategies  is  constrained  to 
be  a  linear  function  of  the  state  variables,  it  was  found  [8]  that  such 
linear  strategies  do  not  exist  because  some  gain  matrices  depend  on  the 
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initial  conditions.  But  by  assuming  that  the  initial  conditions  are 
randomly  distributed  and  averaging  the  performance  indices  over  these 
initial  conditions,  linear  closed-loop  LF  strategies  were  obtained.  In  [11] 
a  linear  closed- loop  LF  strategy  as  described  in  [8],  was  considered,  and  it 
was  shown  that  if  we  restrict  the  space  of  strategies  to  be  taken  from  the 
slow  variable  only,  we  obtain  a  well-posed  formulation.  In  [56]  and  in  this 
chapter,  we  consider  the  linear  closed-loop  LF  strategy  when  both  the  slow  and 
the  fast  are  available  for  measurement.  This  information  structure  is  different 
from  the  one  in  [11].  We  introduce  a  method  by  which  we  can  find  strategies 
using  reduced  order  systems  such  that  if  we  apply  these  strategies  to  the 
full  order  system,  the  resulting  cost  functions  will  have  the  same  limits  as 
the  cost  function  for  the  same  full  order  systems  if  the  full  order  optimal 
strategies  are  applied.  Preserving  the  information  structure  of  the  full  order 
problem  is  the  basic  feature  of  our  procedure.  In  Chapter  3  we  consider  team 
LF  games  for  singularly  perturbed  systems.  We  design  a  well-posed  method  to 
obtain  reduced  order  near  optimal  strategies.  In  this  method,  we  solve  two 
subgames,  one  for  the  fast  modes  and  the  other  for  hybrid  slow  modes  (hybrid 
because  the  fast  gain  is  imbedded),  under  the  constraint  that  the  information 
structure  of  the  full  order  problem  is  preserved.  We  also  show  that  the 
sufficient  conditions  for  existence  of  a  team  LF  solution  for  the  reduced  order 
games  is  equivalent  to  those  of  the  full  order  one  in  the  limit  as  y  tends  to 
zero . 

In  Part  Two  of  this  thesis,  decision-dependent  information 
structure  (DDIS)  is  employed  in  some  classes  of  Nash  and  LF  games.  We  intro¬ 
duce  and  analyze  two  new  models  of  duopoly  with  this  type  of  DDIS  and  we  give 
sufficient  conditions  for  the  existence  of  the  solution  of  two  classes  of 


stochastic  team-LF  games  with  DDIS.  In  Chapter  4  we  consider  Nash  games  with 
DDIS  and  introduce  a  market  model  of  duopoly.  Although  the  concept  of  DDIS 
in  Nash  games  is  not  new  (see  [38]  for  example)  our  analysis  and  approach 
are  different.  In  Section  4.2  we  formally  define  the  equilibrium  Nash 
solution  with  the  two  types  of  information  structure  and  we  give  two  examples 
which  clarify  the  basic  ideas  in  this  section.  In  Section  4.3  we  consider  a 
general  static  market  model  of  duopoly  and  derive  the  necessary  conditions  for 
the  supply  adjustment  controls  of  both  firms  to  be  optimal  in  the  Nash  sense 
for  the  two  types  of  information  structure.  Then  we  analyze,  in  detail,  the 
special  case  of  a  linear  market  demand  relation  and  quadratic  cost  function. 

In  Section  4.4  we  generalize  the  concept  of  DDIS  to  multistage  dynamic  games, 
and  we  give  sufficient  conditions  for  existence  of  the  Nash  equilibrium 
solution  with  DDIS  for  the  discrete  linear  quadratic  problem. 

In  Chapter  5  we  consider  DDIS  as  the  incentive  mechanism  (we  refer 
the  interested  reader  to  [40] , [41] , [42]  for  discussions  on  the  incentive 
problem) ,  which  is  used  by  the  leader  to  induce  the  Nash  followers,  in  a  LF 
game,  to  behave  as  members  of  a  team  with  the  leader's  objective  as  the  objec¬ 
tive  of  the  team,  and  develop  a  static  market  model  of  duopoly  with  the 
government  as  the  market  coordinator.  In  Section  5.2,  the  incentive  problem  is 
formalized  as  (n+1) -person  LF  game  with  one  leader  and  n-Nash  followers.  The 
leader  desires  to  force  the  followers  to  optimize  his  (the  leader's)  objective 
function,  even  though  each  one  optimizes  his  own  objective  function.  In 
Section  5.3  the  incentive  mechanism  of  the  organization  is  formulated  by 
incorporating  the  decisions  of  the  followers  in  the  strategy  of  the  leader. 

By  employing  such  forms  of  strategies,  the  leader  can  force  the  followers  to 
behave  as  members  of  a  team,  with  their  composite  objective  function  contained 


in  the  objective  function  of  the  team.  In  Section  5.4,  we  consider  a  general 
static  market  model  of  duopoly  where  the  government  interfers  in  the  market. 

We  show  that  the  government  can  always  succeed  in  enforcing  the  two  duopolists 
to  cooperate  and  achieve  the  Pareto-optimal  solution.  Then  we  analyze  in 
detail  the  case  of  a  linear  market  demand  relation  and  a  quadratic  cost 
function.  We  obtain  analytic  solutions  for  the  optimal  strategies  of  the  two 
Hash  duopolists. and  the  government.  We  show  that  in  the  limit  as  the  unit  cost 
of  the  govenment  control  tends  to  infinity,  the  enforced  cooperative  optimal 
controls  and  profits  tend  to  the  voluntary  cooperative  ones.  Finally,  we 
discuss  the  general  properties  in  terms  of  marginal  cost,  price,  and  the 
consumers'  welfare  in  the  context  of  this  problem.  In  the  last  chapter  we 
deal  with  two  stochastic  static  LF  problems,  where  the  leader  can,  by  using 
DDIS,  achieve  the  team  solution.  Each  player  has  a  quadratic  cost  function 
and  part  of  his  information  is  a  linear  function  of  Gaussian  random  variables. 

Answers  to  many  problems  in  the  area  of  stochastic  control  with 
classical  information  structure  (in  classical  information  structure,  all 
actions  taken  at  the  same  time  are  based  on  the  same  information,  and  any 
information  available  at  time  t  will  still  be  available  at  time  t'  >  t)  are 
known;  in  particular,  the  problem  of  linear  quadratic  Gaussian  (LQG)  is 
completely  solved.  But,  stochastic  control  problems  with  nonclass ical  informa¬ 
tion  structure  are  more  difficult.  These  problems  are  usually  viewed  in  the 
context  of  team  theory.  The  most  important  theorem  in  team  theory  is 
Radner's  theorem  on  teams  with  static  information  structure  (see  [44]  for  the 
statement  of  the  theorem).  By  using  the  concept  of  nesting,  this  theorem  was 
extended,  in  [43],  to  solve  problems  of  teams  with  dynamic  information  structure. 
Later,  team  decision  making  problems  were  generalized  to  stochastic  game 
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theoretic  framework.  In  [49]  a  two  person  stochastic  Nash  game  with  static 
information  structure  was  considered,  and  it  was  shown  that  under  some 
sufficient  conditions  the  linear  quadratic  Gaussian  Nash  game  admits  a  unique 
equilibrium  solution  which  is  linear  in  the  observation  of  each  decision  maker. 
In  [52]  a  linear  quadratic  Gaussian  two  person  LF  team  game  with  decision- 
dependent  information  structure  was  considered  and  completely  investigated. 

In  the  first  section  of  Chapter  6  we  consider  the  problem  of  a  3- 
person  stochastic  optimal  coordination,  where  the  coordinator  desires  to  induce 
the  two  noncooperative  (in  Nash  sense)  players  to  minimize  his  (the  leader) 
cost  function,  even  though  each  player  minimizes  his  own  cost  function.  The 
coordinator's  cost  function  is  a  convex  combination  of  the  cost  functions  of 
the  Nash  players.  The  information  structure  of  the  game  is  dynamic  and  nested 
since  the  coordinator  knows  whatever  the  fohlower  knows,  and  his  (the 
coordinator's)  strategy  depends  on  the  decisions  of  the  followers.  In  the 
second  section,  we  investigate  a  2-person  LF  game  in  which  the  leader  does  not 
completely  detect  the  decision  variable  of  the  follower.  The  satisfaction  of 
the  condition  of  complete  detectability  of  the  action  of  the  follower  is 
necessary  for  the  leader  to  be  able  to  obtain  his  global  optimal  solution.  The 
case  in  which  the  leader  does  not  completely  detect  the  action  variable  of 
the  follower  was  investigated  in  a  deterministic  setting  by  Basar  [47],  He 
gave  a  general  procedure  by  which  the  leader  can  achieve  a  new  tight  lower 
bound  for  his  objective  function.  In  this  section  we  solve  the  same  problem, 
but  in  a  stochastic  setting,  and  using  a  different  procedure.  We  define  a  new 
modified  team  problem  after  taking  into  account  the  optimal  response  of  the 
undetected  action  of  the  follower.  We  find  that  the  leader  can.  under  a 
certain  condition,  achieve  this  new  tight  lower  bound. 
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Finally,  In  Chapter  7  we  state  some  conclusions  and  we  outline 

directions  for  future  research. 

In  summary,  the  basic  contributions  of  this  thesis  are 

1.  Introduction  of  two  well-posed  procedures  to  obtain  reduced  order  and 
near  optimal  strategies  for  two  classes  of  singularly  perturbed  LF  games. 
The  key  feature  of  these  two  procedures  is  the  preservation  of  the  infor¬ 
mation  structure  of  the  full  order  game,  while  solving  the  reduced  order 
ones . 

2.  Introduction  of  two  new  market  models  of  duopoly  with  DDIS  and  a  detailed 
analysis  of  these  models. 

3.  Determination  of  sufficient  conditions  for  the  existence  of  solutions  for 
both  the  stochastic  optimal  coordination  problem  with  DDIS  and  the 
stochastic  LF  team  problem  with  partial  decision-dependent  information 


structure. 
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part  one 

INFORMATION  STRUCTURE  IN  SINGULARLY  PERTURBED  GAMES 
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CHAPTER  2 

WELL-POSEDNESS  OF  LINEAR  CLOSED-LOOP  LEADER-FOLLOWER  STRATEGIES 

2.1.  Introduction 

In  this  chapter,  we  describe  a  well-posed  procedure  to  obtain 
reduced  order  and  near  optimal  strategies  for  singularly  perturbed  linear 
closed  LF  games.  In  Section  2.2,  we  formulate  the  problem  and  give  the 
necessary  conditions  for  existence  of  a  linear  closed  loop  LF  equilibrium 
solution  for  the  full  order  system.  In  Section  2.3,  the  follower  minimizes 
the  fast  part  of  his  objective  function  under  the  condition  that  the  fast 
subsystem  and  the  fast  part  of  the  leader's  strategy  are  given.  In  Section 
2.4,  Che  follower  minimizes  his  slow  part  of  his  objective  function  under  the 
conditions  that  the  slow  subsystem  is  given  and  that  the  follower's  and 
leader's  strategies  have  the  same  information  structure  as  their,  corresponding 
strategies  in  the  full  order  game.  In  Section  2.5,  the  leader  minimizes  his 
slew  part  of  his  objective  function  under  the  conditions  that  the  slow  sub¬ 
system  is  given  and  that  the  follower  applied  the  above  procedure  to  find  his 
rational  reaction.  In  Section  2.6,  we  find  that  if  we  apply  the  reduced  order 
optimal  gains  to  the  full  order  system,  the  resulting  strategies  and  cost 
functions  will  have  the  same  limits  as  the  strategies  and  the  cost  functions 
for  the  same  full  order  system  if  the  full  order  optimal  gains  are  applied. 

2.2.  Formulation  of  the  Problem 

Let  us  consider  the  singularly  perturbed  system: 


1*  +  A12z 

+  311U1 

+ 

3nu:’ 

(0)  -x 

0 

Is  *  A22z 

+  32l'Ji 

+ 

o  '-'•j  > 

c  (0)  » 

where  n  R  ‘ ,  z  c  R  “ ;  u.  ~  R  and  -  is  a  snail  positive  parameter.  Assume 
that  the  cost  function  associated  with  player  i  Ls 
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Jt  -  e [% J  (y'Qiy  +uiRiiui+ujRijuj'dt:1 


where 


yo  0 


il 

Qi2 

'h 

*i3 

r , i }  .  are  symmetric,  positive  definite  matrices,  t(yo)m0;  E(yoyQ)“I 

where  I  is  the  identity  matrix. 

Let  player  2  be  the  leader  and  player  1  be  the  follower.  A 

closed-loop  linear  Stackelberg  strategy  was  considered  by  Medanic  [8].  In 
his  paper  the  controls  were  assumed  to  be  of  the  form 

U1  ”  ‘Fly  »  u2  *  “F2y ’ 

and  F, ,  the  gain  of  the  leader  is  found  by  solving  the  following  equations. 

(2.1a) 


A '  cMl  +  MlAc  +  M1S 11M1 +  F2R12F2  +  Q1  *  ° 


AcM2  +M2Ac  +  M1S21^1  +  F2R22F2  +  ^2  *  ° 

NlAc+AcNl  ‘SUM2N2  'N2M2Sll  +  S21MlN2  +N2M1S21  “° 


Vc+AcN2  +  I  *° 


R12F2N1  +  R22F2N2  “  32  (M1N1  +  M2N2)  =° 


(2.1b) 

(2.1c) 

(2. Id) 
(2 . le) 


w  n  £  t  s 


_  «  *  •  *  r 

r  i =  RuJr i 


AC  *A  '  S11M1  '  32r2 


S  .  *  3.R’Ld. 
rj  J  JJ  J-J  JJ  J 
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— 

— 

— 

An 

A12 

Bu 

Qu 

Qi2 

A  » 

A21 

^22 

■  Bi  " 

hi 

> 

Qi  - 

«i2 

*13 

V- 

t  • 

r> 

_ 

— 

In  general  by  leering  i*"0  in  Che  full  order  system  we  change  Che  meaning  of 
che  vector  z  from  a  state  variable  to  a  variable  which  depends  on  x.  So  if 
we  solve  the  resulting  slow  optimization  problem,  we  will  have  a  change  in 
information  structure,  lo  avoid  this  change  in  information  structure,  we  solve 
the  problem  as  an  output  feedback  problem,  where  we  constrain  the  feedback  to 
be  taken  from  x  and  z.  This  is  clearly  shown  in  Section  2.4. 

In  Che  following  sections  we  will  show  a  procedure  to  get  a  well- 
posed  solution  of  the  problem  depending  on  reduced  order  systems  while  both 
x  and  z  are  available  for  measurements  for  both  players.  Let 


lap 


•L,  ^  -  L7  2Z* 


Jir 


2ap 


'L21**L22Z- 


The  follower  will  find  L^->  by  minimizing  the  fast  part  of  his  optimization 
function  while  the  fast  part  of  the  system  is  given,  and  he  will  find  by 

minimizing  his  modified  slow  optimization  function.  The  leader  will  find  his 
gains  and  by  minimizing  his  slow  part  of  Che  optimization  function 
under  the  constraints  that  che  follower  applies  the  above  procedure  and  the 
slow  part  of  che  system  is  given. 
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2.3.  The  Fast  Optimization  Problem  for  the  Follower 

The  follower  can  find  Che  gain  by  minimizing  Che  fasc  pare  of 
his  performance  index  which  is 


^lf  *  ZfE(0^2  j  (zfQ13Zf  +u[fRllulf+u2fR12u2f)dc] 

given  chac 


*  dc  "  A22Zf +  B2lUlf +  B22U2f 


Ulf"’L12Zf 


U2f  *  "L22Zf  * 


SubsciCuCing  for  u^,  UTf>  we  8ec 


Jlf  =  z  (0)^2  ^  (zfQ13Zf +  L12RllL12  +  L22R12L22)dt:1 
^Zf  =  (A22  '  B21L12  "  B22L22)zf  =A22Zf  ‘ 


SoLving  Che  problem,  we  gee  Che  following  necessary  condicions 


L12  *R11B21K13 


(2.2) 


-^7 2^1 3  +’  2  R1 3B13RI3  k"1  "^*^13  ~  ^ 


(2.3) 


B13  =  B2lRllB21 ” 
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2.4.  Hybrid-Slow  Optimization  Problem  for  the  Follower 

The  follower  can  find  by  the  following  procedure.  Letting 
^-0  in  the  system  considered  we  obtain 


x  *A.,,x  +A,«z  +B,,u  +B,0u 

s  11  s  12  s  11  Is  12  2s 


°'A21*s+A22zs+a21Uls.+  B22U2s 


and  if  we  constrain  the  controls  to  be  of  the  form 


u,  *-L,.x  -  L, „z 
Is  11  s  12  s 


u2s  ”‘L21Xs  ‘L22Zs 


and  substitute  for  u^g,  we  obtain 

zs  * '(A22"B21L12  "  B22L22)  (A2l"B21Lll“B22L21)xs * 

Assuming  that  nonsln8ular  and  substituting 

for  u^s>  u?s,  zs  in  the  differential  equation,  we  obtain 

Xs  =  ^All’BllLll‘B12L2l" ^A12’B11L12_B12L22^ (A22’B21L12’B22L22^ 

~^A2l”B21Lll"B22L21^Xs 

or 

xs«Vs  .  xs(t)=?(t,0)x(0) 


where 


a  - 1^ 

A0=AU  "  A12A22A21 


A11  3 AL1  ‘  3ULL1  ’  B12L21’  A12  "A12  "  B  11^12  "  312L22 


j  '-^1  3 


'^7^  _  0  ~  ,,  *  a  V?  b-»  -i 
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Substituting  foru^,  u2s*  zs  in  C^e  °Ptimization  function  of  the  follower,  we 


obtain 


Jls  *  x^  2qJ  Xo^'(t,o)^Qll+LnRlfll+L21R12L21  ‘  (Q12  +  LilRllLl2+L21R12L22) 
A22A21  "  A21(A22)  '  (Q12  +  L12R11L11  +  L22R12L21)  +  A21(A22)  ' 

(Q^2  4*  ^“12 R1 1^*12  ^22R12^22 ^A22A21  ^ ^ »°) 3 xq^ ^  1  • 

Applying  the  same  procedure  as  in  the  output  regulator  problem  [12],  while 
using  the  assumption  that  E(xqx^)  »I,  we  obtain 

00 

Jls(Lll,L12)  *  ^trace  J  <e.0)[Qu  '  Qi2^22A21  -A2l(A22^  'Q12  + 

A21  (A22^  ^1 3A22A2 


where 


Q11  QU  +  LUR11LU  +  L21R12L21 


Q12  “  Q12  +  L11R11L12  +  L21R12L22 


Q13  ~  Q13  +  L12R11L12  +  L22R12L22  ' 


Finding  —  and  setting  it  equal  to  zero  we  obtain 
CL11 


L^*ll”R11^12A22A21  ~  R2l  ^A22  ^  ^12  ”  B21  ^A22 ^  ' ^13A’1 '|A'>  1  *  B '' i  (2.4) 


5l  =  3ll‘B21(A22L),(S12)' 


ar.c  K  ^  is  the  solution  of 
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K11A0  +  A0K11  +  ^11  "^12A22A21  _A2l(A22)  *^12  +A21(A22^  '^13^22^21  "  0 ' 

(2.5) 


Substituting  for  obtained  from  equation  (2.2),  (2.3)  in  equation  (2.4), 

(2.5),  we  can  find  Lj^. 

Comments : 

aiis 

(1)  Finding  r~  s  and  letting  it  equal  to  zero  will  lead 

OL12 

to  the  same  equations  as  (2.4),  (2.5).  This  is  due  to 

the  fact  that  z  (t)  is  a  linear  function  of  x  (t). 

s  s 

(2)  If  we  constrain  u^g  "  -LUXs  ,  and  apply  the  same  procedure, 
the  formulation  is  ill-posed. 


2.5.  The  Leader  Problem 

Before  describing  how  the  leader  can  find  L?i ,  L2?,  it  is  ad- 
Vutageous  to  change  the  form  of  equations  (2.4)  and  (2.5)  by  using  equations 
(2.2)  and  (2.3),  and  by  letting  to  be  of  the  following  form 


L11  '  Rll(3ilKll  +  B2lK12:i 


Then  after  some  straightforward  but  lengthy  algebra  equation  (2.4)  leads  to 


v12 


[Q12  +  L21RL 


^22  +  R11A12 


R11B12^22  +A21R13^A22 


(2.6) 


where 


S12  *B11R11321  ’  Sll  =  3llRli3U 


and  equation  (2.5)  becomes 
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K11A11  +  AUK11  ■K11S11K11  "K11B12L21  "L21B12K11  +  Q11  ’  K12S13K12  + 


K12A21+A21Ki2  +  L2lR12L21  “°* 


(2.7) 


By  substituting  for  u^,  U2S  »  2S  t*16  slow  part  of  the  leader's  optimization 
function,  we  obtain 


2s 


2  XE^  i  Xs^21  “^22A22A2l  "  A21  (A22^ '^22 +A21  (A22) '^23A22A21^Xs(t)dt:^ 


o  0 


where 


Q21  "Q21  +  L21R22L22  +  K12S22K11  +  K12S23K12  + K11S21K11 + K11S22K12 
^22  3 ^22  +  L21R22L22  +  K12S23K13  +  K11B22K13 
^23  * Q23  +  L22R22L22  +  K13B23K13 

^21  3BllRllR21RllBll  *  B22  *B11R11R21R11B21’  B23  = B21R11R21R11321 

Let  J-  =  £  [7  x' (0)K,x(0)]  ,  where  K-  satisfies 

Z  x(0)  1  1  Z 

AQR2  +  R2A0  +  ^21  "  ^22A22A21  ”  A21  ^A22^  ^22  A21  ^A22^  *^23A22A2l  *  ^  " 


(2.8) 


So  the  leader  has  to  minimize  under  the  following  constraints 


A22R13  +  R13A22  +  R13313R13  +  ^22R12^22  +  ^13  =  ^ 


KllAll  +  AnKll  "  K11S11K11  "  K11B12L21  ’  L21B12‘<11  +  Q11  *  n12S13N12  “ 


AQR2  +  R2A0  *^21  _^22A22A,1  "A2l^A22^  ^22+A2i^A22^  ^23a22‘a2] 
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where 


K 


12 


’ o  iB.-l. 


12  21  12  22  11  12  11  12  22  21  13J  22 


+a-  k.  ja:1 


The  reader  is  referred  Co  Appendix  A  for  Che  derivacion  of  Che  necessary 
condicions  for  Che  leader's  minimizaCion  problem. 


2.6.  Full  Order  Problem 

In  equaCion  (2.1)  we  assume 


Mil 

mi2 

Nil 

Ni2 

II 

™i2 

"Mi3_ 

Ni“‘ 

_Ni2 

Ni3_ 

Subscicucing  for  in  equacion  (2.1a)  and  leccing  W~0,  we  obCain  Che  following 


A '  M 

cir  Li 


m12(0) 


A  1  M 

c22*13 


(0)  +  Mu  (0)Ac  u  +  A '  2  lM  '  2  (0)  +  Mu  (0)Ac21  +  Mu  (0)  (S^^ 

+m12(0)(s'2mu(0)+s13m|2(0))+f^1(0)r12f21(0)  +qu 

••^12  +  F21(0)R12F21(0)"<A2r?12MU(0)-322F2l(0)),Ml3 
-mu(0)b12f22(0)]a;212 

(0)  +m13(0)ac22  +  m13(0)s13m13(0)  +f42(0)r12f22(0)  +q.3 


(0)+S12M12 
=  0 

(0)  +M11A12 


=  0 


(0)) 

(2.9) 

(2.10) 
(2.11) 


where 


AcLL  *All'Sir!ll(0)‘S12M12(0)“B12F21(0) 

AcL2  =A12'312M13(0)'B12F22(0) 
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Ac21  *  A2rSi2Mll<°> ‘Sl3Mi2  <°> -B22F21  <°> 

Ac22  ■A22"®13?*13^"B22T22^ 
assuming  that  ^^2  i*  nonsingular. 

It  is  noticed  that  equations  (2.3),  (2.6)  and  (2.7)  ere  identical  to  equa¬ 
tions  (2.11),  (2.10)  and  (2.9)  respectively  where  11^(0),  1^(0),  Ml3(0),  F2l(0), 

F22 ^  replace  ^12*  R13*  ^21*  ^*22* 

Substituting  for  in  equation  (2.  lb)  and  letting  H"*0,  we  obtain 

M2l(0)Acll  +  AklM21(0)+^2<0)Ac21+Ac22^2^)+Q2l+F21(0>R22F21(0)+Mll(0) 

(B21Mll(0)  +  B22Mi2(0>)+M12(0)(B22Mll(0)+B23Mi2(0)>  “°  (2*U) 

H21(0)Acl2+^2(0)Ac22+Ac21M23(0)+MU(0)i22M13(0)+M12(0)i23M13(0)  + 

F21(0)R22F22(0)  +Q22  ”  0  (2,13) 

Ac22M23(0)  +M23(0)Ac22+M13(0)B23M13(0)+F22(0)R22F22(0)  +Q23*0’  (2*14) 

From  (2.13)  we  have 

M22"‘tAc21M23(0)+M21(0)Acl2+Mll(0)S22M13(0)+M12(0)S23Ml3(0)+F2l(0)R22 
F22^)  "r^22^Ac22  ' 

Substituting  for  M77  in  (2.12)  and  using  equation  (2.14)  we  obtain  an  equation 
identical  to  (2.8),  where  ^(0)  ,  >1,  ^  (0) ,  M,,7(0),  M,3(0),  Fol(0),  Foo(0)  replace 
K7 ,  K.  i^7,  K,  ^  >  !■■■>]_>  L77  respectively. 
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Decomposing  equation  (2. Id),  and  letting  h>-0  w*  obtain 


AcllN21(0)  +  N21  (0)Acli  +  Ac12N22  (0)  +N22  <°>Ac12  + 1  "  0 
N2l(0)Ac2l  +  N22<0>Ac22-0 


(2.15) 


(2.16) 


Ac21N22  +  N22Ac21  +  Ac22N23  +  N23Ac22  "  °* 

Froa(2.16),wegetN22(0)  «-N21(0)A^21(a"J2)'  «N21(0)V\ 
From  (2.17),  we  getN23(0)  -VN2l(0)V' 


where 


V  “"Ac22Ac21  * 


(2.17) 


Subscituting  for  N22  in  (2.15),  we  obtain 

(Acll‘Acl2Ac22Ac21)N21(0) +  N21(0) *Acli“Acl2Ac22Ac21* '  +  1  "  0 

which  is  identical  to  (A. 2)  where  N2l(0),  Mu(0),  Mu(0),  M13(0),  F21(0),  F,,(0) 
replace  P2>  Ku>  K^,  K13>  L2l,  L2,  respectively. 

After  decomposing  (2.1c)  ,  and  letting  ji  w0,  we  obtain 

CAcllNU(0)  +Acl2Ni2(Q)'(illM21(0)  +Si2M^(0))N21(0)-S12M23(0)N'2(0)  + 

(^2lMll(0)+^22Mi2(0))N2l(0)+^22Ml3(0)N22(0)]  +  CNU(0)Acll +N12  (0)Acl2 
-N'.,1(0)(M2l(0)Sll+M22(0)S{2)-N22(0)M23(0)S'2+N2l(0)(Mu(0)S21+Ml2(0)Sj:) 


+  n22(0)M13(0)S22]  =0 


(2.18) 


N’il(0)Ac21+N12(0)Ac22+N21(0)(M21(0)S21  +  M22  (0)S13} '^22 (0)M23(0)Sl3 


+  Nn(0)(Mu'0)S22+M12(0)S23>  +  N2,(0)M13(0)S23  =  0 


(2.19) 


Ac21N12(0)"Ac22N’l3(0)  +  Hi  (0)Ac21  *  N13 (0)A c 22'T3N22  (°)  ’N22  (0)T3 


"V23(0)“i*23(0)T4  ‘0  ’ 


(2.20) 


24 


From  (2 . 19)  we  obtain 


where 


N12(0)  “  N2l(°>W'  +  NL1(°)V 


W  -  a'J2(t3  +  T4V) 


t3  -  s13m^2(0)  +  s{2m21(0)  -  s23m'2(0)  -  S22M1L(0) 
t4  -  S^Jt^O)  -  . 


Substituting  for  N12(0),  1122(0),  N22(0)  in  (18),  we  get  an  equation  identical  to 
(A.l)  where  Nu(0),  N21(0),  M^O),  Mu(0),  F2l(0)f  F22(0),  ^(0),  M12(0) 

replace  ^2*  1  ^11*  ^21*  *  ^13*  ^12*  tespectively. 

Substituting  for  N12(0),  ^3(0),  M22  ^  in  (2.20)  and  using  equation 
(2.14),  we  get  an  equation  identical  to  (A. 5),  where  N33(0),  11^(0),  N21(0), 
^21  >  ^13(0)  >  ^2^(0),  replace  P3,  P^,  P^,  K^,  K^, 

K12’  K13’  L21’  L22  resPectively- 

Decomposing  (2 .  le)  and  letting  ^-0,  we  have 

R12  (F21  <0)N11<°>  +  F22  (0)N12  (0) }  +  R22  (F2l  (0)N21 (0)  +  F22  (0)N22  (0) >  “Bi2MH (0)NL1  (°) 
"B22  (M12  (°)Nu(0)  +  M13(0)N12  (0)  }  "B12M2l  (0)N2l (0)  'B22  (lM22  (0)N21 (0)  + 


M23 (0)^22  (0)) 


(2.21) 


(R12F21 (0) "312Mli(0) *8:2-I12 (0) )N12 (0) +  (R22F2l (0) 'B12M21 (0) 'B22M22 (0) >N:: (0) 

*B',2‘113^13^^  B.J  „M03  (0)  ]  N,,  3  (0)  =  0.  (-.22) 
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Substituting  for  N^2(0),  N22^’  M22^’  and  N22(0)  in  equations  (2.21)  and  (2.22) 
and  using  aquation  (2 . 14)  ,  wa  gat  aquation  (2 .21)  idantical  to  (A.  3)  and  equation 
(2.22)  identical  to  (A. 4)  where Nu(0)  ,  Nn(0),  ^(0),  Mu(0),  M^O),  11^(0),  F21(0) 

F22  )  *  atJd  *^3  )  replace  P^»  ?2»  ^2*  ^11*  ^12*  ^13*  ^*21*  ^22*  and  ^3 

respectively . 

To  compare  the  performance  indices  resulting  from  solving  the  full 
order  problem  with  the  ones  resulting  from  the  reduced  order  solution  we  need 
the  following  assumptions: 

a.  The  fast  optimization  problem  of  the  follower  has  a  unique  stabilizing 
solution.  In  other  words  there  exists  a  unique  which  is  a  solution 
of  equation  (2.3)  for  each  L22  applied  such  that  \(A22)<0. 

b.  The  slow  optimization  problem  of  the  follower  has  a  unique  solution  after 


substituting  for  L12  from  the  fast  problem,  i.e.  equations  (2.6)  and  (2.7) 
have  a  unique  solution  for  K^,  K^. 

c.  The  leader  optimization  problem  has  a  unique  stabilizing  solution,  i.e. 

there  exists  a  unique  pair  and  as  a  solution  of  the  set  of  equations 
(2.3),  (2.6),  (2.7),  (2.8)  and  (  A. 1)  through  (A. 5)  such  that  X.(AQ)  <0. 
Theorem:  If  assumptions  (a),  (b),  (c)  are  satisfied  then: 


(1)  lim(u.-u.  )  ■  C 

r.-0  1  laP 

(2)  lim  (J.-J*)  =  0 

u  -  0  1  1 


for  i  -  1,2 


vnere 


u.  *  -L..X-L.-Z,  u.  =  -F. , >:-F. _z . 

tap  il  i2  1  2.1  i2 

J 'f  is  the  performance  index  when  u^  and  u0  are  used. 


1 


J.  is  the  performance  index  when  u,  and  u„  are  used. 
1  lap  2a? 
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Proof:  (l)  It  was  shown  that  Mu<0),  M^O),  M^CO),  M^CO),  Nu«)),  N2l(0), 

N13<0)’  F21(0)*  F22<0)  rePlace  Ku»  Ki2»  k13»  V  Pi>  p2*  p3»  l2i>  L22  resPecti-vel-y 

in  the  equations,  and  if  the  uniqueness  assumptions  are  satisfied, 

then  we  have  unique  values  of  Kj^,  K12>  K13»  K2»  P].»  p2»  P3*  l2i»  L22  aad 

,  ^i2  *^i2 »  ^13  “^3(0) ,  P2i^)"^2]i»  f22^^“^22»  K2-M2i(0), 

P1-Nu(°),  P2-H21(0),  P3-Ml3(0>. 

For  the  follower: 


uUp*-LUx-L12z- 


Substituting  for  L^,  we  obtain 


Ulap  "  “Rll(BUKll  +  B2lKi2)x  "  R11B21K13Z 
But  the  exact  control  of  the  follower  is: 


-1  B21 
'RlltBll  ri  ^ 


M11 

^12 

+«i2 

-M13 

X 


r*Tn  • 


”R11^B11^11X  ®21  ^13Z^ 


-1 


'RlJ  (3UMll+32lM12)x+  ^B11M12z  +  B21M13zJ 


Clearly , 


For  the  leader  : 


lira  u,  3  11m  u, 

0  1  .-0  lap 


*  y  •  P  2 

9  7  ‘  -» 9 


u2ap  * "l21x  '  U22Z 


Clearly,  1-m  u,  3  in  u. 


.“0 


_  2ap 

-  J  r 
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(2)  When  the  exact  controls,  u^*-F^y  and  ■  -F7y  are  used  the 
resulted  performance  index  J*  w^s^e  is  given  by  equations  (2.1a), 

(2. lb). If  uLap>  a«d  u2ap  are  used»  where 

“up ■  -RuBiKii'  •  hm  fKu  °~] 


«i2  ^13 


u2ap  *  -V 

we  will  have  as  the  performance  index,  where  “^g^yQ*  and  W]_>  W2  saci-sfy 
the  following  equations 

Wl(A  -  SuKl  -  B2L,)  +  (A  -  SuKl  -  B2L2)  •Wl  +  Qj_  +  KjS^  +  L^R17L2  -  0 

(2.23) 

W2 (A  -  SL1KL  -  B2L2)  +  (A  -  SUK1  -  B,L2)  'W2  +  Q7  +  K^jKj.  +  L^R.,-,^  »  0  . 

(2.24) 

Subtracting  (2.23)  from  (2.24)  and  (2.1a)  from  (2 . lb)  we  find  that 

and  P7  »W7  -  satisfy 

Pl(A  -  SUK1  -  B2L,)  +  (A  -  SuKl  -  B2L2)  ’PL  +  (Kj_  -  ' S L L (K  -  M^)  -  F7) 


■(l2  -  f7)  'b;m1  +  l^r17l,  -f^r1?f7  =*  0 


(2.25) 


?7(A  -  S,1K1  -  B7L7)  +  (A  -  S11K1  -  3:L2)  P7 + KlS2lKL  “  MlS:lMl  '  M2S11(KL  '  V 
-(Kl  -M1),S11M2  -M737  (L2  -F:)  -  (L,  •:2)'3pi2rL:R„L:  -  F4R,,F„  =0.  (2.26) 


'  ?il 


■U?12  I 


Taking  P. 
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and  substituting  for  P^  and  P7  in  equations  (2.25)  and  (2.26)  respectively,  and 


setting  i^~*0,  we  obtain 


PilAll  +  P12A21  +  AllPil  +  A21Pi2 


(2.27) 


PilA12  +  Pi2A22  +  A2lPi3  *  0 


(2.28) 


Pi3A22  +  A22Pi3  *  °* 

Since  is  stable,  Pjj  *  0  is  the  unique  solution  of  equation  (2.29) 


(2.29) 


From  (2.28)  we  have 


Pi2  "  "  ^PilA12^A22 


Substituting  in  (2.27),  we  obtain 


Pil^All  ’  A12A22A21^  +  ^All ~ A12A22A21^ ' Pil *  ° 


°r  P^Aq  +AgPil  3  0.  Since  Ag  is  stable,  P^**).  Thus 

P^  *  0  for  i»l,2,  j  =*  1,2,3 

Remark  1;  In  the  LF  game,  the  leader  announces  his  strategy  first,  and  the 

follower  reacts  by  playing  optimally,  i.e.,  the  follower  chooses  a  strategy  which 

lies  on  his  reaction  curve.  So  if  the  Leader  uses  u«  ,  then  the  follower  has 
*  sap 

to  respond  by  choosing  u^*u^  (^  ) .  our  case  c^e  follower  does  not  choose 

u?.  but  he  chooses  u.  ,  hence  the  rules  of  the  game  are  violated.  This 
i.  lap 

deviation  from  the  basic  definition  of  the  equilibrium  solution  of  the  game  can 

be  tolerated  for  computing  the  optimal  strategies,  if  the  resulting  deviation 

of  the  leader's  and  follower's  performance  indices  from  their  corresponding 

exact  LF  performance  indices  is  small.  In  other  words,  the  use  of  u,  bv  the 
r  uo 
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follower  instead  of  can  be  tolerated  as  long  as  ^^(uiap>u2ap)  tends  Co 
«u2ap)  as  h*  tends  to  zero.  This  can  be  easily  shown  to  be  the 
case  by  using  a  method  which  is  equivalent  to  the  one  given  in  the  theorem. 

We  will  not  pursue  this,  any  further  in  this  chapter. 

Remark  2:  In  this  chapter  and  in  [56],  we  consider  a  different  information 
structure  from  the  one  given  in  [11].  In  [11]  the  space  of  admissable  strate¬ 
gies  is  restricted  to  be  taken  from  the  slow  variables  only,  and  the  usual 
singular  perturbation  technique  is  used  to  obtain  a  well-posed  solution.  But 
in  [56]  and  this  chapter  a  more  general  information  structure  is  considered, 
since  both  the  slow  and  fast  variables  are  available  for  measurement,  and  a 
new  procedure,  which  depends  on  preservation  of  the  information  structure  of 
the  full  order  game,  is  used  to  obtain  a  well-posed  solution. 


CHAPTER  3 


LEADER-FOLLOWER  TEAM  STRATEGIES  FOR  SINGULARLY  PERTURBED  SYSTEMS 

3.1,  Introduction 

In  this  chapter,  we  develop  a  well-posed  procedure  to  obtain  low  order 
and  near  optimal  LF  team  equilibrium  strategies  for  systems  with  slow  and  fast 
modes.  In  Section  3.2,  we  find  the  limiting  behavior  of  the  full  order  game  as 
u  tends  to  zero.  In  Section  3.3,  we  solve  two  reduced  order  games.  First,  we 
give  sufficient  conditions  for  the  existence  of  solutions  for  the  fast  LF  team 
problem.  Secondly,  we  give  sufficient  conditions  for  the  existence  of  solutions 
for  a  hybrid  slow  (hybrid  because  the  fast  gains  are  imbedded)  LF  team  problem. 
The  basic  property  of  this  reduced  order  hybrid  slow  game  is  that  it  has  the  same 
information  structure  as  that  of  the  full  order  one.  In  Section  3.4,  we  show 
that,  under  certain  conditions,  the  reduced  order  games  and  the  full  order  one 
are  equivalent  in  the  limit  as  u  tends  to  zero.  In  Section  3.5,  we  apply  our 
well-posed  procedure  to  the  case  when  the  leader  uses  strategies  with  finite 
dimensional  memory  and  solve  a  numerical  example. 

3.2.  Full  Order  Problem 

Let  us  consider  the  singularly  perturbed  lir.ear  time-varying  system 
x  =  A11(t)x  +  A^„ (t)x  +  B^tju^  +  B17(t)u?  ;  x(0)  *  xq 

-z  =*  A,, ,  ( t )  x  -1-  A„?(t)z  +  3?1(t)u  +  (t)  u?  ;  z(0)  =  zq 

nl  n-> 

where  x€  R  :  z£R  “,  u^eR  for  i*l,2,  v  is  a  small  positive  parameter, 

and  A...  A  ,, ,  A«,7  ,  A?^ ,  B  ^ ,  3^,  B,^.  3,,,,  are  continuous  in  t  for  all 

tS  ft  ,  t .  ]  .  Assume  that  the  ith  olaver  wishes  to  minimize  the  following  function 
L  ~o  z 
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■Vvy 


,  t 

2  /  (y'Q^Oy +  uiRii^t^ui+ ujRijuj^dt;  for  i“1»2 


where  y 


x 


2  J 


;  <^(0 


Qu(t)  Qi2(t) 


,  which  is  a  symmetric 


fJjftxyJT' 

ft; 


definite  matrix,  and  R^,  R  are  symmetric  positive  definite  matrices .  Let  us 
take  player  one  as  the  leader  and  player  two  as  the  follower.  The  procedure 
to  solve  the  team  leader-follower  game,  with  memory  in  the  control  structure 
is  as  follows  [18] : 


1.  Solve  the  leader's  problem  as  a  control  problem  with  J^(u^,u?)  as  the 
objective  function,  and  U^»u2  as  controls  under  the  constraint  of  the 

i 

state  equation.  The  optimal  controls  are  (u  /i^)  ,  where 

^(t)  *  _RiiBiKy 
u2(t)  *  _Ri2B2Ky 

and  K  satisfies  the  following  Riccati  equation 

k  +  ka  +  a'k  +  q.  -  kib.rTJ’b'  +  b_r7^b’]x  -  o 

1  1  IX  1  L  ik.  - 

and 

K(tf)  *  0. 

Define  y  (t)  as  the  resulting  trajectory  when  the  controls  u  (t),u?(t) 
are  applied. 

2.  Consider  the  function  u,  ,  which  is  represent  ..i  bv  a  Lebes gue-Stielt i es 

X 

integral  of  the  following  form 

t 

u,  (t)  =  ■  d  *i  ( t ,  s )  v  (  s  i  . 

X  s 

t 

o 
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such  Chat 

u^(t)  -  u^Ct) . 

Then  with  u^  in  the  objective  function  (J^Cu^.u^)),  of  the  follower  and 
in  the  state  equation,  minimize  ^(u^,^)  with  respect  to  ^  and  find 
conditions  such  that  if  the  optimal  (u^,^)  are  applied  then  the  resulting 
trajectory  will  be  y  (t) .  If  these  conditions  are  satisfied  then  (u^,"^) 
constitute  a  leader- follower  strategy  pair.  These  conditions  are  stated 
in  [18],  but  we  restate  them  here  for  completeness. 

If  there  exists  a  function  n(t,9)  with  n(t,3)*0  for  9>t  and  (n^-H^^n^-h^) 
matrix  P  which  satisfy 


/  dsn(t,s)y  (s)  *  -R^1B^K(t)v  (t) 

te[t0,tf] 

(3.2) 

t 

o 

R~2(t)B2(t)P(t)  *  R*2(t)B2(t)K(t) 

t6[to,tf] 

(3.3) 

t  - 
t 

P(t)y  (t)  -  /  (Q2  +  A'P  +  n'(t,t)F(T))y  (t)dx  =  0 

t 

t€[t0>tf] 

(3.4) 

where 

F(t)  =  R21(t)R“2(t)B^(t)K(t)-B^(t)P(t) 

(3.5) 

and  y  (t)  satisfies  the  following  linear  differential 

equation 

?  -  (A(c)-B1r’Jb^K-B2r“Jb^P)7  (t) 

(3.6; 

v(  t  )  *  V 

0  "  0 

then 
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*  W 

it 


S  f 


c  f 

u^t)  »  /  dgn(t,s)y  (s) ' 


u2(c)  -  -R^CtWtJy  (t) 

are  optimal  leader-follower  strategies.  Furthermore  they  are  also  the  team 


solution. 


Preliminary  analysis  indicates  that  the  following  forms  are  useful 


for  our  problem 


K(t,u) 


Kx(t,u)  yK2(t,p) 
uK^(t,u)  uK3(t,y) 


;  Q 


Qil  Qi2 


Qi2  Qi3 


n(t,s,y)  -  (n^t.s.u)  n2(t,s,u)J 


F(t,p)  -  [F^t.y)  F2(t,y)] 


P(t,u) 


PL(t,y)  yP2(t,u) 
yP. (t,y)  yP, (t,u) 

J  *4  - 


we  substitute  the  forms  of  K(t,y),  Qi>  n(t,s,y),  F(t,y),  and  P(t,y)  as  given 
above  ici  equations  (3.1)-(3.6),  decompose  them,  and  take  the  limit  as  -  tends  to 
zero.  The  reader  is  referred  to  Appendix  B  for  the  resulting  equations. 

Several  remarks  are  made  regarding  the  above  procedure. 

?.l :  The  state  vector  y»£*j  in  equations  (3.2),  (3.4),  and  (3.6)  is  decoupled 
into  fast  and  slow  subvectors  by  using  a  transformation  due  to  Chang  [25] 


For  a  precise  description  of  the  function  ",(t,s)  the  reader  is 
referred  to  [18]. 
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So  the  control  memory  of  the  leader  Is  decomposed  into  a  memory  of  the 
slow  modes  and  a  memory  of  the  fast  modes. 

R2:  In  general  the  matrix  P(t,u)  is  nonsymmetric ,  but  it  is  symmetric  for 
some  special  n(t,s)'s. 

R3:  After  decomposing  equation  (3.4) ,  and  letting  p-*-0+,  we  get  as  one  of  the 
equations 

Cf 

/  (Q22  + A^P^Ct.O)  +  n^(T,t,0)F2(T,0))'?22(T,t,0)dt  *  0. 

But  since  the  state  transition  matrix  has  full  rank  the  above  equation 
implies  (B.  16) ,  and  the  same  situation  applies  in  (B.17)  and  (B.18). 


3.3.  Reduced  Order  Games 

The  class  of  leader-follower  games,  which  is  considered  in  the 
previous  section,  will  also  be  considered  in  this  section  but  for  the  case 
in  which  the  reduced  order  systems  are  given.  The  fast  subsystem  is 


uz. 


A22Zf  +  B21ulf  +  B22u2f 


and  the  fast  part  of  the  objective  function  is 

t„ 


if 


2  [  <ZEQi3Zf  +  UIfRiiUif  +  fRijujf )dt 


o 

Sufficient  conditions  for  the  existence  of  an  optimal  leader-follower  fast 
strategy  pair  with  memoir/  which  coincides  with  the  team  fast  strategy  pair 
are  given  in  Appendix  C. 

If  we  let  a  —  CT  in  the  original  system,  then 


x  3  A. . x  +  X  2  +  B.,u,  +  B._u_ 

s  11  s  12  s  11  Is  12  2s 


0  "  A21Xs  +  A22Zs  +  321u1s  +  322U2s 
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Assuming  that  ^2^0  is  nonsingular  for  all  t€[to>t^],  then 

2s  ■  -A22(A21XS  +  B21ul.  +  B22u2») 

Eliminating  z  ,  we  have 
s 

A  A  A 

x  ■  A-.X  +  B,,u.  +  B,,,u- 

s  11  s  11  Is  12  2s 


where 


A11  *  A11~A12A22A21 ;  ®11  *  B11-A12A22B21 


B12  "  B12~A12A22B22‘ 


The  slow  part  of  the  objective  function  is 


2  {  [xs^llxs  +  xs^i2uis  +  uisQi2Xs  +  xs^i2ujs  +  ujs^i2xs  +  uisRiiuis 
o 

+  u!  R  .  u  +  u !  Q.  .u.  +  u '  Q.’^u.  ]dt 
js  ij  js  isxi3  js  jsxi3  is 


where 


Qil-Qi2A22A2l"(A22A21)  Qi2 +  (A22A21}  Qi3A22A21 
*Qi2A22B2i+  (A22A21)  Qi3A22B2i 


-Qi2A22B2j  +  (A22A21}  Qi3A22B2j 


B2i(A22)  Qi3A22B2j 


Rij  +  B2j(A22)  Qi3A22B2j' 


Sufficient  conditions  for  the  existence  of  the  leader-follower  teamhybrid  s low- 
solution  are  obtained  as  in  [18].  We  have  the  following  facts  and  conditions. 


1.  (u^s,u?s)  are  c^e  ceam  slow  optimal  controls,  where 
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“la  ■  ■®llt^12xs  +  °ilKsX3+  ^13“2s' 


u2s  ’  ~^12  ^12xs  +  ^13uls  +  ®12Ksxs1 


where  xg  is  the  resulting  trajectory  when ulg>  u2g  are  applied, 
u^  and  iT2g  can  be  expressed  in  the  following  form 

(T.  »  -M.  x  ,  for  i*l,2, 

is  is  s  *  * 


and  K  satisfies 
s 


K  +  $11  +  KS  (AirillMls"812M2s }  +  (An-§llMls-§12M2s)  ,Ks-VMls-Mis^2 
■^12M2s'M2s^12  +  Mls^llMls  +  M2s*12M2s  +  M1s^13M2s  +  M2s^lsMls  "  0 


K  (tf)  -  0. 
s  r 


The  leader  applies  a  control  u^g  of  the  following  form 
t  _i  t 


u1o(t)  -  /  (dana(t,s)-d<5nf(t,s,0)A,7A,1)x(s(s)-/  denf  (t,s  .OjA^B^u^Cs) 


22  21  s  >  s  f' 
o 


-f  d=nf(t,s,0)A22B22u2g(s). 


This  specific  form  of  the  leader's  control  is  chosen  so  as  to  preserve 
the  information  structure  of  the  full  order  problem  in  the  limit  as  u  tends 
to  zero,  since  a  closer  look  at  its  form  reveals  that  u^g  can  be  expressed  as 

t  t 

u.  =  /  d  n  ( t ,s)x  (s)  +  /  d  r,  (t,s)z  (s) 
ls^ss  s  c  s  t  s 

O  0 

where  z^  is  the  slow  part  of  the  z-state  vector.  In  the  full  order  problem 
the  leader  has  access  to  the  past  history  of  the  trajectory,  but  in  the 
reduced  order  problem,  he  has  access  to  the  history  of  the  slow  part  of 


the  trajectory,  his  strategies  and  the  strategies  of  the  follower.  Also, 
the  leader  is  not  using  a  standard  slow  control  since  he  is  using  the  fast 
gain  ~ij(t,s)  which  is  obtained  from  the  solution  of  the  fast  game.  The 
use  of  the  fast  gain  is  necessitated  by  the  preservation  of  the  information 
structure. 


.  sufficient  condi_iuns  tor  the  ^xijtence  of  the  hybrid  slow  leader-follower 
solution  are  given  in  Appendix  D. 

.  If  is  not  used  in  2  above,  the  equations  of  the  reduced  order  games  will 
not  correspond  to  the  full  order  ones,  as  will  be  clearly  explained,  later 
in  this  chapter. 

з. 4.  The  Correspondence  Between  the  Reduced  Order  and  the  Full  Order  Games 

This  section  contains  four  lemmas  which  show  the  correspondence 
between  the  reduced  and  the  full  order  games,  and  prepare  for  the  main 
theorem  which  describes  the  procedure  to  obtain  a  well-posed  and  near  optimal 
strategies  depending  on  the  reduced  order  subsystems. 

Before  stating  the  lemmas,  let  us  form  the  composite  controls  u.  (t), 

lc 

и.  (t)  as 
zc 


u  <t)  *  uis  +  uif  +  0(y),  for  i*l,2 

V(t)  ■  -MisVRnB:iKlf(':-0)2f<':,+0(;‘) 
u2c(t)  *  ■M2sVRUB22Klf(C,°)2f(t)+0lj)- 


After  using  the  same  maniDulations  as  in  [9],  we  get  u,  ,u„ 

lc  -C 


of  the  following 
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“lc  ■  -VBU 


-1  ®31  K.(t) 


uK;(t)  uKlf(t>0)J  [z 


where 


,  B ’  K  (t)  0  x 

->  (t.0)  , 

m  it 


Km(C)  *  •A22lKsA12  +  A21Klf(t’0)+Q12"Ks§2Klf(t*0)] 
A22  "  A22"^3Klf(t’0) 

B2  ’  BllRUBil  +  B12Rl2B22 


S3  ’  B21R11B21  +  B22R12822  * 


In  [17],  it  was  shown  that  if 

1*  Ai j, 9 AI2 * A21 9 A22 f ^11 9 ^12 9 ^21 9 ^22  continuous  in  t, 

2.  are  continuous  in  t  for  i»l,2,  j*l,2,  and  i#j  , 

3.  Equation  (C.4)  has  a  unique  solution  as  ~  tends  to  zero. 

4.  The  fast  subsystem  is  asymptotically  stable. 


1  im  [  K.  ( t ,  u )  -K-  .  ( t ,  u )  ]  -  0  Vt£[t  ,tj 

-,-0  3  lf  °  f 


lim[K  (t,u)-K  (t) ]  -  0 
u-0  2  m 


vtetto.tf) 


lim[K  (t,u)-K  (t) ]  =  0 
u-0  1  3 


vte[t  ,t.i 

o  t 


and  as  a  result  we  have 


39 


lim[u. (t,u)-u,  (t,p)]  -  0  for  1*1,2,  and  for  all  te[t  ,t,) 
U-K)  1  ic  °  f 

where 

u^t.u)  *  -R^B^Ky 
u2(t,u)  - 

The  relevance  of  these  results  to  our  problem  will  be  clear  later. 

Lemma  3.1;  Let 


3 

si  - 

[B11 

B12 

VJ 

o 

5 

52- 

tB21 

B22 

'RiiB2iKif(c'0)" 

;  uc  * 

Ulc 

LU2c. 

u 

s 


G  x  : 
o  s 


G2Zf  * 


Form 


[  ( I  +  G2A22B 2 ) Gq  +  G2A22A21 ] x  +  G2z . 


If  us>  u^,  are  applied  to  the  slow,  fast,  and  the  full  order  systems 
respectively,  and  if  (A72+§2G2)  Is  asymptotically  stable,  then 

lim(x(t)-x  (t))  *  0  Vte[t  , t] 

u-0  3  °  f 

lim(z(t) +a:J(A71+B,G  )x  (t)-z.(t))  *  0  Vte[t  t  ) 

Z2  21  2  o  s  f  of 

u—O 

where  xs(t),z.(t)  are  the  resulting  trajectories  after  applying  ug  and  u 
the  slow  and  the  fast  subsystems  respectively. 

Proof:  When  we  apply  u^  to  the  full  order  system  we  get 


40 


A11+51(I+G2452)Go+51G2A22A21  A12+hG2 

iA22+52G2>A22(A21+“2Go)  A22+52G2 


It  is  clear  that  the  present  system  is  exactly  equivalent  to  the  system 
described  by  equation  (B.8)  before,  since  equal  controls  are  applied  to  the 
full  order  system.  Using  the  usual  singular  perturbation  techniques,  see 
f 17] ,  we  get 


lim(x(t)-x  (t))  -  0  Vte[t  ,t_] 

»  s  or 

u-K) 

lim(z(t) + A  *(A-.+B,G  )x  (t)-zf(t))  -0  Vte(t  ,t,). 

n  n  ii.  l  os  t  of 

u-K) 

After  noting  the  equivalence  between  the  present  system  and  the  system 
described  by  equation  (B.8),  it  is  easy  to  see  that 


lim(ii)  .  (s,t,u)-4>  (s,t))  *  0 

n  xl  S 

u-K) 


lim('|)22(s,t,u)-'l>f(s,t,u))  -  0 
u-K) 


where  :j;^^(s,t,u)  and  ^22(s,t,u)  are  the  state  transition  matrices  described 
by  equations  (B.li)  and  (B.12)  respectively. 

Lemma  3.2;  If  in  the  limit  as  u  tends  to  zero,  equation  (C.4)  has  a  unique 
solution,  and  there  exist  unique  values  for  ne(t,s,0),  K_  (t,0)  which  satisfv 

i.  L  ~ 

r 

equations  (C.I),  (C.2),  and  (C.3),'and  if  the  fast  subsystem  is  asymptotically- 

stable  i.e.  \(A_^)  <0  then 
«■ 

:<1  (t,0)  -  :<3(t,0);  nf(t,s,0)  *  q2(t,s,0):  K2  (t,0)  =  ?4(t,0) 

F . (t ,0)  =  F-(t,0);  S,-(r,t,0)  *  >oo(r,t,0) 

X.  L.  L.  *»  *» 


where  K3,  P4>  F2*  411(1  ^22  are  defined  in  Section  3.2  and  Appendix  B. 

Proof:  If  we  let  y-*0+  in  equations  (C.l),  (C.2),  (C.3),  (C.4),  (C.5),  and 

(C.6)  of  the  fast  game,  and  then  compare  them  with  (B.14),  (B.5),  (B.18),  (B.3), 

(B.7)  ,  and  (B.12)  of  the  full-order  game,  we  notice  that  the  first  set  of  equations 

are  exactly  equivalent  to  the  second  set,  where  K.  (t,0),  nf(t,s,0),  K»  (t,0), 

if  r 

Ff(t,0),  <j>f(x,t,0)  replace  K3(t,0),  n2(t,s,0),  P4(t,0),  F2(t,0),  ^22(t, t,0) 
respectively.  The  uniqueness  assumptions  stated  in  the  lemma  are  sufficient 
for  equality  of  the  above  terms.  The  stability  condition  is  necessary  and 
sufficient  for  the  boundedness  of  iji^(r,t,0). 

L*mma  3.3:  If  the  assumptions  of  Lemma  3.2  are  satisfied,  and  if  there  exists 

unique  values  of  n  (t,s),  P  (t)  which  satisfy  equations  (D.l),  (D.2),  (D.3) 

s  s 

and 

Q22  +  A2iK2  (?.°)  +  Hg(T,t)Ff(T,0)  -0  ...  Vt<T  (3.12) 

then 

P.  (t,0)  «  P  (t)  ;  n.  (x,t,0)  -  n  (r,t)  and  d  n1(:,t,0)*dn  (t,t)^ 
where  and  are  defined  in  Appendix  B. 

Proof:  Comparing  equation  (B.6)  of  the  full  order  game  with  equation  (D.4) 
we  notice  that  by  adding  the  term  (Ff-R2iRnB2lKl  'BnK?  >A2? (A^-B^M^-B^M^)  , 

it 

In  general  if  f^Ct),  f(t)  are  differentiable  almost  everywhere  (a.e.) 

dr  (t)  f(  . 

for  all  n,  and  f  (t)-fcf(t)  pointwise,  this  does  not  imolv  that  — - - '  — 

n  ^  -  at  at 

a.e.  for  example,  take  f  (t)  sinnt. 

n  v  n 
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which  is  equal  to  zero,  to  equation  (D.4)  ,  we  get  an  equation  equivalent  to 
(B.6)  ,  after  using  the  identity 


Mls+RilB21Klf(t,0)A22(A2l"B21Mls”B22M2s)  '  RU(BilKl(t’0)  +  B’^Ct.O)) , 
and  substituting  for  P3(t,0)  - -(A^*)  *  [Q^+A^P  (t.OJ+n'FjU.O)  ]  Stained  from 


.-1, 


equation  (B.17)  in  (B.6).  So  F-^t.O),  P^t.O),  n2(T,t,0),  F2(t,0)  in  (B.6) 

replace  Fs+FfA22(A21-B21Mlg-B22M2g)Pg(t) ,  nf(t,t,0),  Ff(t,0)  respectively 

in  (D.4). 


Comparing  equation  (B.4)  with  equation  (D.2)  ,  we  see  that  by  using 

the  identity 


R12(B12Kl(t’0)+B22K2(t’0))  "  M2s  +  R22B22K2  (t’0)A22(A2rB21Mls"B22M2s) 


,-l. 


>-1, 


and  by  substituting  for  the  value  of  P^Ct.O)  from  (B.17)  in  equation  (B.4), 
the  two  equations  will  be  equivalent,  vhere  P^(t,0),  F^(t,0),  n2(r,t,0), 

F2(t,0),  P4(t,0)  replace  Pg(t),  Fs+FfA22^A2l"B21Mls~B,72M2s)  ’ 

Ff(t,0),  and  K?  (t,0)  respectively, 
t 

Equations  (D.l)  and  (B.  13)  will  be  equivalent  after  using  some  of 
the  previous  identities,  and  we  will  have  d  n. (t,s,0)  and  d  n.,(t,s,0)  replace 

S  1  S  4- 

dg’ig(t,s)  and  dgn^(t,s,0)  respectively. 

If  we  substitute  for  P^(t,0)  using  (B.17),  and  P4(t,0)  using  (B.1S) 
in  equation  (B.15)  ,  we  notice  that  equation  (B.15)  is  equivalent  to  equation 
(D.3)  where  PL<t,0),  ^(s.t.O),  ~2(s,t,0)  replace  Ps(t), 

Fs+FfA*^(A21-B21M  -B,,M2g) ,  ns(s,t)  and  nf(s,t,0)  respectively. 
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If  ng(t,t)  also  satisfies  equation  (3.12),  which  is  equivalent  to 
equation  (B.16)  ,  then  uhder  the  uniqueness  assumption  given  in  the  lemma,  we 
have 

n^s.t.O)  -  ng(s,t) 

P.  (t,0)  -  P At) 
l  s 

dgn1(t,s,0)  =  dgns(t,s). 

Before  stating  the  main  theorem,  we  need  the  following  definition  and  one 
more  lemma. 

Definition:  A  function  f(t,y)  is  said  to  be  well  behaved  in  u,  if  there 
exists  an  integrable  function  g(t)  such  that  for  all  u,  f(t,u)<g(t)  for 
almost  all  t. 

Let  <p  (t,t  ,y)  be  the  state  transition  matrix  which  satisfies 
o  o 

^0(t,tQ,'4)  *  A(t)®o(t,to,y) ;  $(to,tQ,y)  =  I 

where 


> 

H* 

H* 

rt 

' — ^ 

A12(c) 

A(t)  “ 

A21(t) 

A2,(t) 

L  u 

4 

and 


B 
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Lemma  3.4-  If  A00(t)  is  asymptotically  stable  Vt6[t  ,t4],  then  lim  4>(t,t  ,u)B(u) 

°  r  u+0  0 

is  bounded  for  all  t>t  . 

o 

Proof :  See  Appendix  E: 

Theorem:  If  the  assumptions  of  Lemmas  3.2and3.3are  satisfied,  and 

d  n,  (t,s,u),  d  n~(t,s,u),  u-(t,p),  u,,(t,y)  are  well  behaved  in  u,  and  if  the 
si  s  /  1  L 

leader  and  the  follower  choose  u,  ,  u„  as  their  controls,  where 

lap  2ap 

t  t 

u.  (t)  *  /  d  n  (t,s)x(s)  +  t  d  n£(t,s,0)z(s) 

X3.p  ^  S  S  t  S  ^ 

o  O 


B  1 

u2ap(t)  '  'R12[B12 


Kg(t)  0  x 

yK’(t)  ulL  (t,0)  z 
m  it  J  *- 


1)  lio(u.(t,p)-u1  (t,u))  »  0 
u-*0  c‘i' 


Vt€[to,tf] 


lim(u2(t,u)-u2a  (t,u))  -  0 
u-K)  ? 


Vcslto-t£) 


2)  lira(v(t,u)-y  (t,u))  -  0 
a-0  ap 


vts[to,cf] 


3)  lim(J. (a)-J.(u))  =  0 

„-*o  1  iap 


for  i=l,2 


where  y(t,u)  and  J^(p)  for  i*l,2  are  the  resulting  trajectory  and  objective 
functions  from  applying  u.(t,u),  u.(t,u),  while  y  (t,y)  and  J.  (p)  for 

1.  Z  3.  p  i&p 

i3l,2  are  the  resulting  trajectory  and  objective  functions  from  applying 
Ulap(C,")’  u2ap(C,u)' 


1.  For  the  follower  (player  2),  it  was  proved  before  that 


lim(u,(t,-)-u^  (t,-))  =  0 

--0  -  -ap 
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For  the  leader 


lim(u1(t,y)-u1  (t,y))  -  11m  /  (d  n, (t,s,y)-d  n  (t,s))x(s) 

U-0  1  lap  n^Ot  81  s  s 

o 

t 

+  lim  /  (d  n,(t,s,u)-d  nf(t,s,0))z(s) 

P-K)  t  S  L  S  r 

o 


since 


d  n,  (t,s  ,p)  -*■  d  n  ( t , s )  ,  pointwise 
Si.  s  s 


d  n-(t,s  ,y) -►  d  nf(t,s,0),  pointwise 

S  4  St 


and  since  lira  x(s)  and  lim  z(s)  are  bounded,  because  of  the  stability 
U-HD  p-*0 

assumption  in  Lemma  3.2,  and  since  d  n.(t,s,p),  d  n7(t,s,u)  are  well 

S  x  S  Z 

behaved  in  u,  then  by  the  Lebesgue  Convergence  Theorem  (LCT) ,  we  get 


lim(u  (t,p)-u  (t,u))  »  0 

u-*0  L  iap 


vte[to,tf] 


Let  y 


‘,1  Tui' 

Lz  Lu2 J 


y(t,4)  *  i(t,to,u)y(0)  +  /  4>(t,T,p)B(u)u(7,vi)dT 

Co 

t 

yap(t,M)  “  <?(t,to,y)v(0 )  +  /  i(t,x,u)B(y)uap(T,u)d- 

Co 

t 

lim(y(t,u)-y  (t,u))  »  lira  /  $(T,t,;OB(u)  (u(t,u)-u  (t,u))dt 

u-0  ap  UH.o  t  ap 

o 

since  i(t,t,u)B(u)  is  bounded  Yu  and 


lim(u(t ,u)-u  (t,u))  =  0 

n  ac 

u-0 


then  using  LCT,  we  get 


lim(y(t,u)-v  (t,u))  =  0, 


If  u^(t,u)  for  1*1,2  are  applied  to  the  system,  the  resultiong 
objective  function  of  the  leader  is 


Ji<u)  mjf  (y'Q1y  +  u^R11u1  +  u^R12u2)dt, 


If  u.  (t,u)  for  i-1,2  are  applied  to  the  system,  the  resulting  objective 
lap 

function  of  the  leader  is 


,  £ 

J,  (y)  *  tt  /  (y'  Q,  y  +u'  R.,u,  +ul  R.  ,,u-  )dt 

lap'  J  2  {  v/apyl7ap  lap  11  lap  2ap  12  2ap 


(Jl(u)'Jlap(u))  *  \  l  y’Ql(y~Vdt+i  [  yapQlCy~yap)dt 

o  o 

+  2  £  UilRll(ul”Ulap)dt  +  2  £  uiapRll(urUlap)dt 
o  o 

tf  t- 

+  i  /  u;Ru(u2-u2ap) /  u2apRl2<u2-u2ap,at' 

o  o 

Using  Che  results  of  (1)  and  (2),  and  the  well-behavedness  property, 
and  then  applying  LCT  we  get 

lim(J. (y)-J.  (u))  -  0. 

u-0  1  lap 

The  same  procedure  can  be  applied  for  the  follower’s  objective 
function,  and  we  get 


lim(Jn (u)-J^  (u) )  *  0. 

-  2ao  ■ 

u— 0 

en  claims  chat  under  proper  assumptions  if  the  leader  obtains 

s'  from  che  soiucion  of  the  fast  and  hybrid  s  low  games  respectively,  and 

to  toe  full  order  s"stem,  3nd  if  the  follower  uses  u„  ,  as 
^  *.aD 

;  i:ove,  then  the  resulting  objective  functions  of  both  plavers  are 
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If  the  fast  game  is  solved  as  before,  and  the  slow  game  is  solved 

independently  of  the  fast  information,  i.e.,  the  leader  applies  u.. 

-  ls 
ii.  »J  d  T]  (t,s)x  (s),  then  the  sufficient  conditions  for  the  existence  of 

.LS  ^  S  S  S 

the  reduced  order  LF  team  strategies  may  be  satisfied,  while  these  conditions 
are  not  satisfied  for  the  full  order  LF  team  strategies  (as  will  be  shown 
in  the  next  section).  This  makes  such  a  procedure  undesirable.  But  if  we 
can  show  that  the  sufficient  conditions  for  existence  of  the  full  order 
and  the  reduced  order  LF  team  solutions  are  satisfied,  then  this  procedure, 
in  which  the  fast  and  the  slow  information  are  decoupled,  is  well-posed. 

This  procedure  is  exactly  equivalent  to  the  one  used  in  control  problems. 

The  basic  feature  of  our  procedure,  which  depends  on  the  preservation  of  the 
information  structure  of  ''.he  full  order  game,  is  that  the  conditions  of  the 
reduced  order  games  are  equivalent,  under  certain  assumptions,  to  the  full 
order  ones,  in  the  limit  as  the  small  parameter  tends  to  zero.  This 
feature  makes  our  procedure  more  general  and  more  desirable  than  the  other 
one . 


3.5.  The  Case  of  a  LF  Team  Strategy  with  Finite-Dimensional  Memory 

In  the  previous  sections  we  have  assumed  that  the  leader  uses 
a  strategy  which  is  described  by  a  Lebesgue-Stieltj es  measure.  This 
general  strategy  provides  the  leader  with  much  flexibility  in  enforcing 
his  team  solution.  The  sufficient  conditions  under  which  the  leader  can 
enforce  his  team  solution,  by  using  such  a  strategy,  are  described  by 
intgro-dif ferential  equations  which  are  very  difficult  to  solve,  hence  such 
forms  of  strategies  are  unappealing  in  applications.  Fortunately,  it  is 
possible  to  greatly  decrease  the  mathematical  complication  of  the  sufficient 
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conditions,  by  allowing  the  leader  to  adopt  a  slightly  less  flexible  and 
less  general  strategy.  In  particular,  he  can  choose  the  following  represen¬ 
tation  of  his  team  strategy: 

Y ^(t,x(s),s  *  t)  ■  -R^BjKy(t)  +P'(y(t)  - y(t) )  +T* (W(t)  -W(t)) 

(3.13) 

where  W(t)  is  an  n-dimensional  vector  function  which  satisfies 

W -AW+Cy  +  B^  +  D^  W(0)»0 

and  W  is  the  solution  of  the  above  equation  with  y,  u^,  and  are  replaced 
by  their  optimal  team  forms.  The  leader  has  the  freedom  to  choose  the 
matrix  functions  A,  C,  B,  D,  P  and  T  which  gives  him  a  high  degree  of 
flexibility. 

If  the  leader  announces  the  strategy  (3.13),  then  the  follower's 
reaction  to  that  can  easily  be  found,  and  the  sufficient  conditions  for  the 
team  solution  to  exist  can  be  obtained,  by  basically  following  the  same 
procedure  described  in  section  (3.2).  Sufficient  conditions  for  existence  of 
the  LF  team  solution  are  given  in  the  following  proposition,  which  we  state 
here  without  proof. 

Proposition  3.1:  If  there  exist  appropriate  dimensional  matrix  functions 
A(*)»  C( • ) i  3( • ) ,  D ( • ) j  P(-)  and  !(•)  so  that  the  following  holds: 

r'^CB^M  +  D'N)  =  R^232K 

where  K  satisfies  equation  (3.1),  M  and  N  satisfy  the  following  matrix 


equations : 
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M  +  MF  +  F'H  +  Q2  +  PB[M+KB2  rUB2M  ■  PR21RUBiK  +  “lRiiR21RUBiK 
+  (C  +  BP'  -  BR‘*BjK) 'N  -  0  M(tf)  ■  0 

N  +  NF  -  T'R21B[K  +  T'B|M+ (A  +  BT') ’N  -0  N(tf)  -0 

and  F  -  A  -  BjR^BjK  - 


(3.14) 

(3.15) 


ux-  CP’  -  R^BjK)y(t)  -  P'y(t)  +  T '  (W-W) 
u2  * -Ri2B2Kt 


are  optimal  LF  equilibrium  pair.  Furthermore,  they  are  also  team  solution. 

We  assume  that  the  auxiliary  system  (the  memory  system)  employed 


by  the  leader  is  also  singularly  perturbed,  with 
n„ 


V 

w  ■  W„  ’ 


where  w^tR  and 


w2€R  .  Thus,  the  matrix  functions  A,  B,  C  and  D  take  the  following  forms: 


r^u 

J12  j 

1 

1 

n| 

•-* 

»-* 

C12 

s 

_  > 

B  a  i  _  j  , 

C  = 

A21 

A22 

!  h : 

fn 

CM 
CM 1 

u 

J 

L  -  j 

j  u_l  ! 
L  ^  J 


3.3.1.  Reduced  order  Games 

The  method,  previously  described,  to  obtain  reduced  order  and 


near  optimal  LF  strategies,  with  the  leader  using  a  strategy  wich  an 
infinite  dimensional  memory  can,  obviously  be  appLied  to  the  special  case 
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when  the  leader  adopts  scracegi.es  with  only  finite  dimensional  memory. 

In  this  section,  we  will  use  Chis  method  to  obtain  reduced  order  strategies 
for  the  case  when  a  finite  dimensional  memory  is  employed.  We  will 
restrict  ourselves  to  a  brief  description  of  the  procedure  and  the  results, 
since  a  detailed  analysis  has  been  given  in  the  previous  sections. 

In  the  fast  LF  game,  the  leader  designs  his  fast  strategy  as: 


ulf  * 'RllB12Klf  +  Pf (zf “zf ^  +  Tf (W2f “W2f ^ 


where  w^  satisfies: 

^W2f  "^22W2f +^22Zf +  R2ulf +°2U2f 

Rlf’  Zf  are  as  <*escri*,ec*  before  and  the  fast  subsystem  and  the  fast  part  of 

the  objective  function  are  given  in  section  2.3.  Sufficient  conditions  for 

the  existence  of  the  fast  LF  team  strategies  are  equivalent  to  the  ones 

B22  ^2  Ff  ^2 

given  in  proposition  3.1,  with  *klf,  ^lf,  pNf,  ,  -j-,  Q03,  Pf,  Tf, 

^22  ^22  .  —  —  —  10  —  - 

>  ^  i  and  w replaces  K,  M,  N,  B2  >  D2  *  ^2J  ^2J  P*  ^  >  y  ^nd 

w,  respectively  in  those  equations,  where 

Ff  ■  (A,2  ~  B12RllBl2Klf  "  B22R12B2Klf) • 

In  the  hybrid  slow  game,  the  leader  picks  the  following  representation  of 
his  optimal  slow  team  strategy: 


“u  ■  -Muxu  ^<VV  +?£<S-23) 


+  T 1  (w„  -w  ) 
z  2s  2s 


(3.16) 


This  specific  representation  of  the  Leader's  optimal  slow  te3m  strategy 
has  the  same  information  structure  in  the  limit  as  -  tends  to  zero,  as  che 
full  order  strategy  of  che  leader  which  is  described  by  equation  (3.13). 
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To  preserve  Che  information  structure  of  the  full  order  game  while  solving 
the  hybrid  slow  one,  we  need  to  transfer  the  fast  information  to  the  slow 
game.  As  before,  the  slow  subsystem  is: 


AU*s  +  BUuls  +  B12u2s 


And,  equivalently  the  slow  memory  subsystem  can  be  described  as: 

w,  »£,.w  +T!11x  +  B.u,  +  D,  u„ 

Is  11  Is  11  s  1  Is  Is  2s 

where 

£-Au  -A12A22A21,  cu -CU  “Cl2A22A21  -I12A22C22  +A12A22C22A22A21 


L.mB.I  —  Af  nA.nB.  4“  A,  nA.nCnnAn.£ 


B’*B1  'C12A22B21  "A12A22B2 +A12A22C22A22B21 
°1  “°1  '^12A22B22  '  A12A22B2  +Al2A22^22A22B2 


By  substituting  for  Z  ,  Z  .  w,  and  w_  ,  equation  (3.16)  can  be  rearranged 

s  s  z  s  z  s 


uls  ■-M1SX3+PS<VV  +  Is<“ls'“ls)  "Lu2s 


(3.17) 


where  the  fast  gains  are  imbeded  in  M^g,  P  ,  Tgf  L  and  M.  Now,  if  the 
leader  announces  the  strategy  (3.17),  then  the  follower  reaction  can  be 
easily  determined,  and  the  sufficient  conditions  for  the  existence  of  the 
hybrid  slow  LF  team  problem  can  be  derived.  These  conditions  are  given  in 
the  following  proposition: 

Proposition  3.2:  If  there  exist  appropriate  dimensional  matrix  functions 
zl  — 

All(')’  CI1(‘)’  Dls  (•)>  Ps(»)  and  T_(*)»  so  that  the  following  holds 

(R22+L  ^21  ”  **  ^22 ^22"ls"r^  B22^  ”  ^23‘^ls  ”  ^23*'* 

-  L'B'M  -B'  M  -l'B'N  +  D,N  )  =M„ 

Ils  X2s  is  Is  2s 


where  M  and  N  satisfy  the  following  matrix  equations 
s  s 

Ss  +Ms?s  +521  -  +«22(-«U  +“2>  -»>  +  <-5is  + 

+  <P3  -5i3>S2l<-5U+“2s  -»>  '  <PS  -MisKQ;3M2s>  +  (Ps  -M{s)B{1+A-l)Ma 


+  (=ll  +  <fs-«is)W° 


(3.18) 


i3+N3F3+^2+isS2l(-Mis+iil2s -5) 


+  (IsBl+Ai1)Ns-0 


(3.19) 


—  A  A 


r.-‘u-,ll1l.-'lA 


“is  ■  -MUXS  +  P;<XS  •  V  +T3(V13  -”ls>  '  LU23 

“2a  ■  'Vs 

are  the  equilibrium  LF  hybrid  slow  team  solution. 

If  the  sufficient  conditions  for  existence  of  the  solutions  of 
both  the  hybrid  slow  and  fast  LF  team  subgames  are  satisfied,  then  the 
leader  can  form  the  following  reduced  order  strategy  and  apply  it  to  the 
full  order  system. 


“lap  •  V  +Pf(Z  -  *3  •  *£>  +I3(U1  'V  +Tf<“2  -U23  -“2f> 

where  (the  composite  control  which  is  defined  in  section  3.4),  ?g, 

T  ,  T, ,  x  ,  z  ,  w,  ,  w_  ,  z,  and  w_  -  are  obtained  from  the  solutions  of  the 
s  t  s  s  is’  2s  t  2 t 

reduced  order  games.  Similarly,  the  follower  will  apply  u,  = u„  ,  to  the 

.ap  —  C 

full  order  system. 

Under  conditions  which  are  equivalent  to  the  ones  given  in  the 
Theorem,  the  low  order  strategies  u^  and  u-)ap  are  well-posed  in  the  sense 
that  they  tend  to  the  opcimal  strategies  and  costs  respectively,  in  the 
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limit. as  r  tends  to  zero.  The  infinite  horizon  version  of  the  problem  can 
be  easily  produced  by  setting  the  derivative  term  in  equations  (3.14), 

(3.15),  (3.18)  and  (3.19)  equal  to  zero  and  adding  the  proper  stabilizability 
conditions.  It  is  clear  that  our  procedure  can  oe  readily  applied  to 
infinite  horizon  games.  Before  we  conclude  this  chapter,  we  present  an 
example  which  shows  that  the  sufficient  conditions  for  the  existence  of 
the  team  LF  slow  solution  may  be  satisfied,  while  these  conditions  are  not 
satisfied  for  the  full  order  problem. 


Example :  Let  the  system  be  described  by 


X  “  -X  +  z 

vi  ■  -z  +  -4-  u.  +  -4-  u„. 

/2  1  /2  2 

Let 

*  J  /  (x2  +  z2  +  uj  +  u2)dt 
o 

00 

•*2  *  ^  /  (x^+2z2+2u2-f-u2)dt. 

L  o  1  Z 

Assume  player  one  to  be  the  leader,  while  player  two  to  be  the  follower. 


Full  Order  Problem:  The  team  solution  is 


u  -  -  [0. 313x  +0.414z  ]  =  -0.225x  -0.293z 

1  » 2 

u„  -  -  [0.313x  +0.414z  J-  -0.225x  -0.293z 

2  /2 


and  the  leader's  cost 


\  y’CO)Sy(O) 


where  y  3 


x 

z 


and 


S 


0.450 

.-0.313 


-0.318 
uO .414 
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Let  the  leader  choose  his  strategy  to  be  of  the  form 

u^t)  -  -0.225x(t)  +  P^xCO-x  (t))-  0.293z  +  P2(z-z  (t)) 

where  (x  >z  )  is  the  optimal  trajectory  resulting  from  applying  u^Ct)  and 
u2(t) . 

If  the  leader  announces  the  above  strategy,  then  the  optimal  reaction  of  the 
follower  will  be  unique  and  a  function  of  P^  and  p^.  It  can  be  checked  that 
there  exist  no  and  by  which  the  leader  can  force  the  follower  to  play 
with  him  as  a  team.  Clearly,  if  we  use  our  procedure,  which  depends  on  pre¬ 
serving  the  information  structure  of  the  full  order  problem,  to  solve  the 
example,  we  will  find  that  the  leader  cannot  enforce  his  team  solution  by 
adopting  the  aforementioned  affine  strategies.  Instead,  let  us  solve  the 
pure  slow  (no  information  about  the  fast  game  is  used)  problem. 


The  Slow  Problem 


-x_  +  (uls  +  u2s) 


/2 


Is 


2s 


00 

jf  (xf+1-5uL  +  1-5u2s  +  UlsU2s)dt 
o 

00 

\  I  (xs+3uls  +  2u2s’+2ulsu2s)dt- 


Team  Solution: 


Is  s 

Assume  the  leader  chooses 


0 . 159x  ,  u-  *  -0 . 159x 


u,  *  -0.159x  +  ■-(:<  -x  ) 

Is  S  S  5 


3v  solving  the  problem,  we  can  see  that  if  the  leader  chooses  l=-2,  the 
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sufficient  conditions  for  existence  of  a  team  leader-follower  for  the  slow 
problem  is  satisfied.  So  without  preserving  the  information  structure,  the 
sufficient  conditions  of  the  reduced  order  problem  may  not  correspond  to 
those  of  the  full  order  problem.  o 

The  procedure  given  in  this  chapter  which  is  based  on  preservation 
of  information  structure  of  the  full  order  problem  while  solving  the  reduced 
order  ones  can  be  applied  in  general  to  all  linear  quadratic  singularly 
perturbed  games.  So,  we  expect  a  well-posed  solution  of  the  usual  reduced 
order  problem  if  the  optimal  strategies  do  not  depend  on  the  information 
structure  as  it  is  the  case  for  control  problems  and  zero-sum  games. 


NASH  AND  LEADER-FOLLOWER  STRATEGIES  WITH 
DECISION-DEPENDENT  INFORMATION  STRUCTURE 
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CHAPTER  4 

NASH  STRATEGIES  WITH  DECISION-DEPENDENT  INFORMATION 
STRUCTURE  AND  A  NEW  MODEL  OF  DUOPOLY 

4.1.  Introduction 

In  this  chapter  we  consider  Nash  games  with  a  decision-dependent 
information  structure  (DDIS)  whereby  one  player  formulates  his  strategy  as  a 
function  of  the  decision  of  the  other.  We  compare  properties  of  the  solutions 
to  those  involving  a  normal  information  structure  (NIS)  whereby  the  strategy 
of  each  player  is  not  a  function  of  the  decision  of  the  other.  In  Section 

4.2,  we  discuss  the  equilibrium  Nash  solution  with  two  types  of  information 
structure,  we  compare  the  Stackelberg  solution  concept  with  the  Nash  solution 
with  DDIS,  and  we  give  two  examples  which  clarify  the  basic  ideas  in  this 
section.  In  the  first  part  of  Section  4.3,  we  consider  a  general  static  market 
model  of  duopoly  and  derive  the  necessary  conditions  for  the  supply  adjustment 
controls  of  both  firms  to  be  optimal  in  the  Nash  sense  with  the  two  types  of 
information  structure.  In  the  second  part  of  Section  4.3,  we  analyze  in  detail 
the  case  of  linear  market  demand  relation  and  quadratic  cost  function,  we  obtain 
analytic  solutions  for  the  optimal  strategies  of  the  two  firms,  we  show  that 
the  profit  of  the  firm  with  DDIS  is  more  than  its  corresponding  profit  with  NIS, 
and  we  discuss  the  general  properties  in  terms  of  marginal  cost,  marginal 
revenue,  price,  and  the  consumer's  welfare.  In  Section  4.4,  we  generalize  the 
concept  of  DDIS  to  multistage  dynamic  games,  we  solve  a  two-stage  dynamic 
duopoly  with  a  linear  demand  and  quadratic  cost  function,  and  we  give  sufficient 
conditions  for  existence  of  the  Nash  equilibrium  solution  with  DDIS  for  the 
discrete  linear  quadratic  problem. 
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4.2.  Nash  Solution 

Let  rx  and  ^  be  the  spaces  of  admissible  strategies  for  players  one 
and  two  (P^  and  P^)  respectively,  with  y^€  and  7516  strateSies  ^  are 

mappings  from  the  information  space  to  the  control  action  space  of  each  player. 
The  nature  of  the  information  structure  should  be  specified  in  each  situation. 

Let  ^2.^y1’y2^  an<*  ^2^Y1,Y2^  corresPon<*in8  payoff  functions  of  the  two 

players.  A  Nash  equilibrium  assumes  that  if  one  player  maximizes  on  the  basis 
that  the  other  player's  strategy  is  known  and  it  is  at  equilibrium,  then  this 
player  will  find  his  optimal  strategy  at  equilibrium. 


Definition  :  The  rational  reaction  set  of  player  i  to  permissible  strategies 

of  player  j,  D1(y.)  is 

3  • 


°l(v 


{y*€  !’i  such  that  £  Ji^Yi,Yj^  for  a11 

and  each  y^}* 


If  the  reaction  strategy  of  player  i  against  Yj  is  unique,  then  we  can 
describe  the  reaction  set  D*(yJ  by  f^OiO  where  Y^  *  f  ) . 


■^.2.1.  Mormal  vs.  Decision-Dependent  Information  Structure  (NIS  vs.  DDIS) 

In  a  Nash  game  both  players  are  required  to  declare  their  strategies 
simultaneously  before  the  start  of  the  game,  but  the  sequence  of  their  action 
depends  on  further  rules  and  assumptions  of  the  game.  If  both  players  apply 
their  strategies  simultaneously,  the  decision  of  any  of  the  players  will  not 
be  available  to  the  other,  and  as  a  result  no  player  can  formulate  his 
strategy  as  a  function  of  the  other  player's  decision,  and  their  roie  in  the 
game  is  totally  symmetric.  In  such  a  case  we  say  that  the  information  structure 


59 


of  the  game  is  normal.  If  the  two  players^-  apply  their  strategies  sequentially, 
let  us  assume  that  P2  plays  first,  P^  has  access  to  the  decision  of  P^ ,  and 
P^  has  a  choice  of  using  this  information  in  his  strategy.  Thus  P^  may  declare 
a  strategy  which  is  a  function  of  P,'s  decision,  but  the  declarations  are 
simultaneous.  The  difference  between  normal  and  decision-dependent  information 
structure  in  Nash  games  will  be  clarified  in  the  following  example. 

Example  4.1: Let  J^(u^,u2)  an<*  ^2^U1*U2^  C^e  Pa^0^  functions  of  and  P^ 
respectively,  where 


<ul’“2) 


<VU2) 


-3u2  -  10u2  -  2u^u2 . 


Under  the  normal  information  structure  assumption,  no  player  can  formulate  his 
strategy  as  a  function  of  the  decision  of  the  other  player.  So  each  player 
will  maximize  his  objective  function  on  the  basis  that  the  strategy  of  the 
other  player  is  fixed,  yielding 

u1N  -  -3/2  and  u2N  =  -  7/6 


where  (u^.,u?^)  constitutes  the  Nash  equilibrium  solution  with  a  normal  infor¬ 
mation  structure.  The  Nash  values  of  the  payoff  functions  are  J  *  -0.472, 
J2V-  +4.087. 

If  P9  will  play  first,  and  the  value  of  u2  will  be  available  to 
when  he  applies  his  strategy,  can  announce  a  strategy  which  is  a  function 


A  two-player  Nash  game  with  decision-dependent  information  structure 
can  be  generalized  to  N-player  as  follows:  Let  us  arrange  the  N  players  in  such 
a  way  as  ?]_  makes  the  first  move,  P9  the  second  move,...,  and  P^  the  last  move. 
So  each  player  i  can  formulate  his  strategy  as  a  function  of 
decisions  for  all  i  =  2,...,N. 
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of  u?.  For  simplicity,  let  this  function  be  restricted  to  be  linear  in  u^ , 
i.e.,  u1-bur 

The  Nash  equilibrium  solution  with  DDIS  can  be  determined  as  follows 
?  maximizes  on  the  assumption  that  u2  is  fixed,  and  we  have 

3 

U1D  -  "  2  * 

?,  maximizes  on  the  assumption  that  u^  ■  bu2 ,  and  the  optimal  u^  is 

U2D  *  3+ 2b  ‘ 


It  can  be  seen  that  if  there  exists  a  real  number  b  such  that 


bu 


2D 


2 

2 


men  the  Nash  solution  with  DDIS  exists.  Substituting  for  u?^  in  the  above 
equations,  and  solving  for  b,  we  find  that  b*2.25  is  the  unique  solution. 
Thus,  there  is  a  unique  equilibrium  Nash  solution  with  DDIS,  which  is 

(2.25u2d,  -0.667). 

The  values  of  the  payoff  functions  which  correspond  to  this  equilibrium 
solution  3re  1.3602  and  J2D  a -‘■3 . 334 .  P1  has  benefited  from  his 

accitional  information  (the  decision  of  P^) ,  since  his  payoff  function  under 
IT  IS  is  greater  than  che  corresponding  payoff  function  under  NIS.  P,,  suffers 
u  little  but  the  collective  payoff,  J  -hJ0  is  improved. 


Nash  Solution  with  DDIS  vs,  Stackelbers  Solution 

The  Scackelbere  solution  assumes  chat  the  two  players  have  different 
There  is  a  leader  anc  there  is 


3  rollover.  The  leader  announces  his 


2 

strategy  first  and  as  a  result  he  can  impose  a  solution  which  is  favorable  to 
himself.  It  is  the  order  of  announcing,  not  the  order  of  action,  that  distin¬ 
guishes  the  leader  from  the  follower. 

As  was  pointed  out,  in  the  Nash  solution  with  DDIS,  both  players 
announce  their  strategies  simultaneously,  where  announces  a  strategy  which 
is  a  function  of  P2's  decision,  i*e*»  u^“g^(u2).  If  chooses  g^  to  be  the 

same  as  f^  (see  the  previous  Definition),  then  PL  is  choosing  a  strategy  which  is  the 
same  as  his  reaction  function.  It  is  not  difficult  to  see  that  the  best  strategy 

for  P2  in  the  Nash  sense  will  be  equivalent  to  the  Stackelberg  equilibrium  solution 
with  P2  as  the  leader  and  P^  as  the  follower,  and  with  NIS,  that  is,  the  leader 

plays  first.  Such  a  choice  of  g^  does  not  worsen  P2's  payoff  function  (it  may 
improve  it)  since  the  payoff  function  of  the  leader  in  the  Stackelberg  solution 
is  at  least  as  good  as  (and  possibly  better  than)  that  of  the  corresponding 

Nash  solution  with  NIS  (see  [5]  )•  But  the  value  of  *  s  payoff  function  may  be 
worse.  If  the  follower  applies  his  strategy  first,  then  the  leader  can  formulate 
a  strategy  which  is  a  function  of  the  follower's  decision.  This  enhances  the 
leader's  opportunity  to  enforce  a  team  solution.  (For  a  discussion  of  the 
Stackelberg  team  solution,  see  [26]). 

Determination  of  the  Nash  solution  with  DDIS  and  thus  the  form  of 
the  mapping  g^  which  achieves  the  best  J^,  is  not  an  easy  problem  and  it  needs 
further  investigation.  But  it  is  desirable  for  P^  to  make  use  of  the  informa¬ 
tion  available  to  him  (the  strategy  of  ?^)  by  choosing  an  appropriate  function 
,  which  increases  his  payoff  relative  to  the  corresponding  Nash  payoff  with 
NIS.  Clearly,  can  always  disregard  the  information  available  to  him  and 
choose  a  robust  strategy  (a  strategy  which  is  insensitive  to  any  decision  made 
by  the  other  player).  The  solution  in  this  case  will  be  the  Nash  solution  with  NTS. 

“The  leader  can  announce  his  strategy  first  either  due  to  the  lack  of 
information  of  the  other  player  about  the  leader's  performance  index  or  due  to 
differences  in  size  and  strength. 
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A  simple  example  which  illustrates  some  of  the  basic  properties  of 
the  solution  is  presented  below. 

Example  4.2:  The  payoff  functions  and  defined  on  R^xr\  are  assumed 

to  be  convex  and  have  contour  lines  as  shown  in  Figure  4.1.  The  reaction 
1  2 

curves  D  (u^)  and  D  (u^)  are  loci  of  tangent  points  between  the  contour  lines  and 
the  lines  of  constant  and  respectively.  Point  N,  whose  coordinates  are 
(u-^.u^)  is  the  Nash  equilibrium  solution  with  normal  information  structure. 
Point  S  is  the  Stackelberg  solution  with  as  a  leader  and  P^  as  a  follower 
under  NIS.  (g^ (u2D) *U2D)  is  the  resulting  Nash  equilibrium  solution,  with 
P.'s  strategy  depending  on  P2's  declsi°n*  This  equilibrium  point  is  evaluated 
as  follows:  P^  finds  his  reaction  set  d\u2).  P2  maximizes  J2(u^,u2)  taking 
the  declared  strategy  of  P^,  which  is  u^*g1(u2),  into  account  and  finds  u2Q 
as  the  optimal  solution.  If  D^(u2d)  *  g^(u2^) ,  as  it  is  in  Figure  4.1,  then  the 
Nash  solution  with  DDIS  structure  exists. 


4.3.  Market  Model  of  Duopoly 

Let  there  be  two  firms  which  have  access  to  the  same  potential 
buyers.  The  two  firms  share  the  production  of  a  commodity  (or  two  perfect 
substitute  commodities)  with  the  quantity  of  production  of  each  firm  as  its 
strategic  variable.  The  market  price  is  determined  by  a  special  demand  mechanism. 

In  this  demand  mechanism,  the  market  is  cleared  of  whatever  quantities  the  firms  offer. 
The  sales  are  assumed  to  be  made  on  one  occasion;  thus,  actions  for  a  sequence 
of  periods  are  ruled  out.  The  Nash  equilibrium  solution  assumes  that  if  one 
duopo List  maximizes  on  the  basis  that  the  other  duopo list's  output  is  known  and 
it  is  at  equilibrium,  then  this  duopolist  will  produce  the  equilibrium  output. 
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4.3.1.  General  Model 

Let  denote  the  output  rate  produced  by  firm  i  and  Q  denote 
the  total  marketed  quantity.  Hence. 

Q  =  ql  +  q2* 

Let  us  suppose  that  the  aggregate  demand  can  be  represented  by  a  continuous 
function  as  follows : 

p  *  f(Q) 

such  that  the  price  p  and  the  quantity  Q  vary  inversely.  It  is  reasonable  to 
assume  that  each  firm  wishes  to  maximize  its  net  profit  J^,  where 

Ji  *  eii'W 

where  h^Cq^)  is  what  it  costs  firm  i  to  produce  at  a  rate  qi>  and  it  is  assumed 
to  be  an  increasing  function  of  q^. 

4. 3. 1.1.  Nash  Equilibrium  Solution  with  NIS 

With  NIS  no  firm  can  formulate  its  strategy  as  a  function  of  the 
other  firm's  decision.  Nash  theory  postulates  that  each  firm  chooses  a  rate 
of  sale  that  maximizes  its  net  revenue  given  the  competitor's  rate  of  sale. 

The  equilibrium  Nash  solution  with  NIS  can  be  found  by  solving  the  two 


equations' 


5Ji(q1’q2) 


-fern  3h, (q, ) 

=  f (Q)  +  -*1^-  q- - r— ~  -  0  for  i-1,2. 

1  3d . 

‘I 


We  require  that  q.>0,  so  the  necessary  conditions  should  be  written 
1  JJi(q1,q,) 

as  an  inequalitv  as  follows:  — - - —  <0.  But  since  this  is  not  the  main 

jqi 

issue  of  the  paper,  we  assume  that  the  solution  obtained  is  nonnesative,  for 
simplicity . 
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It  is  clear  that  firm  i  should  choose  its  rate  of  sale  such  that  its 
marginal  cost  (MC)  equals  its  marginal  revenue  (MR).  In  addition,  the 
price  (p)  is  higher  than  the  marginal  cost  (MC)  (since  <  0) ,  a  situation 

which  is  different  from  perfect  competition  where  price  (p)  equals  marginal 
cost  (MC) . 


4. 3. 1.2.  Nash  Equilibrium  Solution  with  DDIS 

Whereas  the  two  firms  announce  their  strategies  simultaneously  as  with 

NIS,  the  amount  supplied  by  firm  two  will  be  known  when  firm  one  supplies  its 
commodity,  and  firm  one  is  going  to  use  this  information  structure  in  its  declared 
strategy.  Such  a  type  of  information  structure  can  be  realized  if  firm  one  has 
spies  or  agents,  who  inform  their  management  of  the  decisions  of  firm  two,  or  if 
firm  two  has  to  apply  its  strategy  first  (e.g.,  the  plant  of  firm  two  may  be  at  a 
distance  which  is  farther  from  the  market  than  that  of  firm  one,  so  to  compensate 
for  the  effect  of  transportation  delay,  firm  two  may  have  a  faster  production 
facility).  Due  to  the  information  structure  of  the  game,  the  strategy  space  of 
firm  two  can  be  described  as  q2*a?»  where  a,£R  ;  the  strategy  space  of  firm  one 
can  be  described  by  q^ag^(q0),  where  the  function  is  chosen  by  firm  one. 

The  procedure  to  find  the  Nash  equilibrium  solution  with  DDIS  is 
as  follows : 

(1)  Firm  one  will  maximize  its  net  profit  on  the  assumption  that  q0  =  a7,  which 
can  be  put  mathematically  as 


3^1^1,C*2^  5f(0)  (q  ) 

-  f(Q)  +  ■XgU-  q  -  ~  —  =  0. 

}qL  )Q  1 

Solving  for  q^  as  a  function  of  in  the  above  equation,  firm  one  obtains 
its  reaction  function  f^(-)  and  q^  “  f,  (q7)  • 

(2)  Firm  two  will  maximize  its  net  profit  on  the  assumption  chat  q.  (.q.,) 
and  find  q„^  as  the  optimal  output  rate  as  follows 

.-ro)  -c  ’h2(q->) 

f(0)  -  ~~r~~  ^  a., - = — —  =  o. 

jQ  -qn  2  jq0 
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In  this  case 


Jfi. 


/  1  as  it  is  for  NIS,  but 


3Q  _  ,  .  3gl(q2) 


3q. 


-  1  + 


3q* 


3gl(q2) 

It  can  be  seen  that  depending  on  the  value  of  — — -  at  the  optimal 

strategy,  we  may  have  the  marginal  cost  (MC)  either  higher,  equal,  or  lower 
than  the  price  (p) . 


(3}  if  s1(q2D) *  fx  ^q2D^’  c^en  Nash  solution  with  DDIS  exists. 


4.3.2.  A  Linear  Model  case 

In  this  section,  we  will  seek  analytic  solutions  for  the  Nash 
game  with  NIS  and  DDIS.  This  will  help  us  understand  the  difference  between 
the  two  solution  concepts  and  find  the  extra  profit  which  firm  one  may  have 
obtained  due  to  its  extra  information. 

4 

Let  Q  and  p  satisfy  an  inverse  linear  demand  relation  given  as 


0  *  d  +  d,p 
o  1 

where  dQ  is  a  positive  constant,  while  d^  is  a  negative  constant.  Let  the 
cost  function  h^(q^)  be  of  the  form 

hi(qi)  "  2  Ciqi  ^  ki 
where  k^  and  c^  are  positive  constants. 

i. 3.2.1.  Mash  Equilibrium  Solution  with  NIS 

Firm  i  maximizes  J,  over  q.  under  the  constraint  that  q.  is  given. 

^  1  1 

where  i,j*l,2  and  if j .  The  reaction  curve  of  firm  i  can  be  found  to  be 


a  . 

'  l 


a  .  . 
‘J 


See  !36]  for  a  discussion  of  a  Linear  demand  relation. 
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Solving  Che  above  equations  for  1*1,2  simultaneously,  we  find  that  the  Nash 
equilibrium  solution  with  NIS  is 


d  -c,d,d 
o  2  1  o 


*1N 


3**Cld1'2c24l  +  c1c2d1 


d  -  c,  d.d 
o  1  1  o 


*2N 


3-  2cldl"  2c2dl  +  clc2dl 


2  • 


Thus  * cl2N^  *s  a  ^eas^^e  solution  since  the  q^N  are  clearly  non-negative. 


4. 3.2.2.  Nash  Equilibrium  Solution  with  DD1S 

For  simplicity,  let  us  assume  that  the  space  of  admissible  strategies 

of  firm  one  is  linear  in  the  decisions  of  firm  two,  i.e.,  q^*bq2,  where  b£R 

qx  +  q2  -  dQ 

Firm  one  will  maximize  J,  on  the  basis  that  q„  is  fixed,  and  p* - - - , 

1  2  di 

and  it  will  find  q1Q  to  be 


qlD  ’  2  -  c  dx  "  2-  c^  q2D' 


Firm  two  will  maximize  J?  on  the  basis  that  * bq^  and  p 


%  +  q2  ~  dc 


and 


it  obtains 


M2D  2+2b-c2d1 

which  is  clearly  in  the  strategy  space  of  firm  two.  A  sufficient  condition 
for  (q^^,q9^)  to  be  an  equilibrium  Nash  pair  with  DDIS  is  that 


0q2D  *  2  -  c^  "  2-c1d1  q2D ' 


Substituting  for  q  in  the  above  equation  and  solving  for  b,  we  find  that 

*mU 
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.  -cldl  > 


Hence,  (q  )  constitutes  a  unique  equilibrium  Nash  pair  with  DDIS,  where 


'ID 


,  'Cldl  . 


*2D 


-c.d  d. 
1  o  1 


‘2D  2  * 

2  •  2cldl  -  +  clc2dl 


It  can  be  easily  shown  that  is  a  feasible  solution  (i.e.,  the  output 

rate  of  each  of  the  firms  is  non-negative) . 

The  question  which  remains  to  be  answered  is  whether  firm  one 
benefits  from  the  additional  information  it  has  available  (the  output  rate  of 
firm  two).  This  question  will  be  answered  in  the  following  proposition. 
Proposition  4.1;  In  a  market  duopoly  with  a  linear  DDIS,  linear  demand  relation 
and  quadratic  cost  functions,  the  profit  of  the  firm  possessing  the  additional 
decision  information  is  increased  compared  to  that  in  the  absence  of  information. 
Furthermore,  the  profit  of  the  other  firm  is  decreased  compared  to  its  profit 
in  NIS. 

Proof :  The  net  profit  of  firm  one,  J^,  is  as  follows 
J1  "  Pql~2  Clql'kl* 


Substituting  for  p  by 


fql  +  q2-do 


becomes 


J, 


,,  2-c^ 

2d, 


qlq2 


q.d 
1  o 


-  <c 


1* 


The  optimal  strategy  of  firm  one  in  both  NIS  and  DDIS  should  satisfy  i-~ 
reaction  curve  equation,  namely, 

d 


2  -  c.d 


11 


d. 


'1  1 
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Hence,  on  the  reaction  curve  of  firm  one,  J^,  is  as  follows 

-  2  f  2  ~  Cldl  ^ 

J1  ql^  -2d1  J  kl 

is  a  strictly  increasing  function  in  q^.  So  it  is  sufficient  to  show  that 
q^D >  q^N  to  prove  the  first  part  of  the  proposition.  But  this  is  the  case  since 

d  -c„dnd 

_ o  2  1  o _ 

qlD  *  2 

2- 2c1d1- 2c2d1  +  c1c2d^ 

which  is  obviously  greater  than  q^.  For  firm  2,  the  profit  under  NIS 
(J2N)  is 


•-C2<M 


J2N  "  q2N^  -2dx  j  ~k2  ’ 


The  profit  under  DDIS  (J2D)  is 


J2D  *  q2D 


^2D2^2N’  t^en  we  ^ave 


-c. d, d“ 

1  1  o 


,/l-c,  d,  \2 


where 


-  (2-c_d. )d"  - V 

2  1  o\  a-t-1 


a  -  2-  2c1d1-  2c2d]_  +  c^d*. 


After  some  manipulation,  we  find  that 

-2  +  c1d1  +  2c2d1  -  c^d"  >  0 

which  is  a  contradiction,  so  > J?  .  a 

Consequently,  firm  two  should  not  let  its  output  rate  of  production  be  known 


to  firm  one,  because  it  will  end  up  suffering. 


I 
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Before  concluding  this  section,  several  Important  remarks  should  be 

mentioned. 

R4. 1: In  the  special  case  when  the  two  firms  have  the  same  cost  function 
(c^»Cj*c),  the  two  firms  will  have  the  same  output  rate,  and  as  a 
result,  the  same  net  profit  of  Nash  game  with  NIS  as  the  solution  concept. 
But  it  can  be  shown  that  the  net  profit  of  firm  one  is  more  than  that  of 
firm  two  under  DDIS. 

R4.2 :  If  we  compare  the  total  output  of  the  two  firms  under  NIS  and  DDIS,  we 
find  that 


QN  ”  qlN  +  q2N 


2d  -  c. d.d  -  c0d  d 
o  1  1  o  2  1  o 

3  -  20^  -  2c2dL  +  c^d* 


and 


<D 


*1D 


+  q 


d  -  c„d.  d  -  c.  d,  d 
o  2  1  o  1  1  o 


2D 


2  -  20^  -  2c2di +  cic2dl 


2  ’ 


It  can  be  easily  checked  that  the  total  output  rate  of  the  two  firms  under 
DDIS  will  be  less  than  the  total  output  rate  under  NIS .  This  implies  that  the 
welfare^  of  the  consumers  is  worse  off  under  DDIS  conditions,  since  the 
firms  sell  less  quantity  at  higher  prices  in  these  conditions. 

R4.3:  The  reaction  curve  of  firm  i  under  NIS  can  be  expressed  as  follows 


PN^qlN’q2N^  "  CiqiN 


for  i-  1,2. 


Under  DDIS  the  reaction  curve  of  firms  one  and  two  can  be  expressed, 
respectively,  as  follows 


"’An  aoproDriate  definition  of  consumers  social  welfare  ( 0 )  is  the 

0 

consumer  s  surplus,  which  is  mathematically  given  as  W(0  )  ■  1  '  , 

•  6  N 

where  0^  and  ?  are  the  equilibrium  values  of  quantity  and  pr*ce,  respectively . 
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and 


where 


PD(qlD,q2D)  ~  ClqlD 
PDtqlD,q2D)  "  C2q2D 

b  * 


dx  qlD 


37 


-cldl  I 


for  firm  one, 


for  firm  two. 


From  the  above  equations ,  it  is  seen  that  the  price  (p)  is  higher 
than  the  marginal  cost  for  both  firms  and  for  the  two  kinds  of  informa¬ 
tion  structures.  But  for  each  firm  the  difference  between  the  price  (p) 
and  the  marginal  cost  (MC)  is  higher  for  the  solution  with  DDIS  than  for 
NIS.  Hence,  Nash  solutions  with  NIS  and  with  DDIS  do  not  satisfy  group 
rationality.^  But  the  Nash  solution  with  NIS  is  closer  to  the  "perfect 
competition"  solution  than  that  with  DDIS  since  the  difference  between  price 
(p)  and  marginal  cost  (MC)  is  less  for  each  firm.  • 

R4.4:  In  the  linear  duopoly  model  we  examined,  we  assumed  the  strategies  of 

firm  one  to  be  of  the  form  q^“bq2*  Without  this  assumption  the  DDIS  Nash 
condition  yields  a  general  class  of  nonunique  nonlinear  solutions  q^“g(q2)» 
which  have  values  of  (b^D^  anc*  3^°Pes  equal  to  (b)  at  q  =*  Consequently, 

we  will  have  a  class  of  equilibrium  Nash  solutions  which  leads  to  the  same 
payoff  functions  but  with  different  sensitivity  properties. 

R4 . 5 ;  If  the  strategy  space  of  firm  one  is  assumed  to  be  of  the  form  ^2.  *  al  +  ^lq2* 
where  a^  and  b^€R;  while  keeping  the  strategy  space  of  firm  two  unchanged, 
the  reaction  curve  of  firm  one  will  stay  as  before,  i.e. 


must  equal 


^To  satisfy  group  rationality 
the  marginal  cost  (see  [3^-]) 


which  includes  the  buyers,  the  price 
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cr. 


I 

I 


I! 

I 


qlD  "  2-c^  '  2-c^  q2D* 

But  the  reaction  curve  of  firm  two  will  be 

d  -  a. 
o  1 

q2D  "  2  +  2b x  -  c2dx  * 


It  can  be  seen  that  depending  on  the  values  of  a^  and  b^,  we  have 
different  equilibrium  Nash  solutions ,  hence  we  have  an  uncountably 
infinite  number  of  Nash  solutions  with  DDIS (for  a  more  detailed  discussion 
of  a  similar  situation,  the  reader  is  referred  to '[24]).  These 
equilibrium  Nash  strategies  are  not  implementable  since  neither  firm  knows 
which  Nash  strategies  the  other  firm  will  apply. 

Mcil:  The  price  elasticity  of  demand  at  the  Nash  equilibrium  point  with 
NIS(nN)  is 


1  ~  c^d^  ”  c2^i  cic2^i 


'N 


2-ciV 


7dl 


The  price  elasticity  of  demand  at  the  Nash  equilibrium  point  with 


DDIS  is 


1  -  Vl  -  c2dl 


It  can  be  seen  that  and  the  market  is  always  price  elastic  (an 

increase  (decrease)  ir  price  leads  to  a  reduction  (increase)  in  the  amount 

of  money  spent  on  the  commodity)  under  the  conditions  of  DDIS  but  under 

2 

NIS  conditions  it  will  be  price  elastic  if  and  only  if  d^c  c, >1. 

Fy4 . 7 :  The  duopoly  model  is  necessarily  simplistic  to  avoid  technical 

complications.  However,  it  serves  to  clarify  certain  aspects  of  the 
nature  of  duopolist  competition.  Furthermore,  this  motivates  examina¬ 
tion  of  various  forms  of  strategy  spaces  from  a  game  theoretic  viewpoint. 
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4.4.  Multistage  Dynamic  Problem 

In  this  section  we  will  ^aneralize  the  basic  idea  presented 
in  Section  4.2  to  multistage  dynamic  games.  Let  the  evolution  of  the  system  be 
described  by 

Vl  '  Wulk-u2k-W 

\  “iV 

where  x^e  X  is  the  state  of  the  game  at  stage  k,  u^eD  is  the  control  of 

ac  stage  k  for  i*l,2.  The  function  h^  is  continuously  differentiable  in 
u-j^»  u2k‘  Let  c*ie  objective  function  of  be 

Ji(“ru2)  *  sN(V\!os£(V“ik'u2k) 


where 


u.  *  (u.  «...  *  u .  ) 

1  10  S-i 


and  the  function  S*  is  continuously  differentiable  in  , u^ t u2k . 

Let  us  assume  that  at  the  start  of  each  stage  of  the  game,  P2  makes 
the  first  move  and  P^  makes  the  second  move.  So  the  information  which  may  be 
(but  not  necessarily)  available  to  player  i,  at  stage  k,  let  us  call  it  n.(k), 


n,(k)  »{xo,x1,...,xk;  ^20 . u2w,u10 . ulwJ 

nL(k)  *{VV  *’V  u10”"’u1k-l,u20 . u2k}* 

j.f  the  information  available  to  the  players  is  meraoryless  (n  (k)  =  x,  and 

2  1c 

nx(k)  -(xk,u2k),  then  ?1  and  P£  have  to  choose  u^  *  and  u2k  -  Y^Cx) 

respectively,  as  their  optimal  strategies  in  the  Nash  sense. 

As  in  the  static  case,  the  general  solution  is  extremely  difficult 
to  ootain.  But  it  we  assume  a  certain  strategy  form  for  P  and  Po!  solve  a 
control  optimization  problem  for  each  player  on  the  assumption  that  the  strateg 


form  of  the  other  player  is  given,  and  if  the  resulting  optimal  strategies  have 
the  same  form  as  our  assumed  ones,  then  we  say  that  such  a  strategy  pair  is 
a  Nash  equilibrium  solution  with  decision-dependent  information  structure.  If 
the  resulting  optimal  strategies  do  not  have  the  same  form  as  our  assumed  ones, 
then  we  try  to  find  sufficient  conditions  for  the  equality  of  the  two  forms, 
and  these  conditions  will  be  sufficient  for  the  existence  of  a  Nash  equilibrium 
solution  with  the  assumed  strategy  forms.  Before  we  give  the  sufficient 
conditions  for  existence  of  a  Nash  solution  with  DDIS ,  we  will  look  into  the 
problem  of  a  two-stage  dynamic  duopoly. 

4.4.1.  Two-Stage  Dynamic  Duopoly 

Assume  that  the  two  duopolists  face  a  demand  relation  of  the  form 

where 

Qt  =  total  quantity  demanded  in  period  t 
P  c  3  price  in  period  t 

and  d  >  0,  d,  <  0,  and  d„ >  0. 

o  i  l 

Such  a  demand  relation  is  more  realistic  than  the  static  one  since  the  quantity 

demanded  by  the  consumers  in  period  t  depends  not  only  on  current,  but  also  on 

the  buyer's  view  of  future  prices,  (see  £37]),  since  if  the  future  price 

ij  going  to  be  high,  the  consumers  will  have  high  current  demand.  Let  us  assume 

-  -  :t  two  firms  produce  the  same  commodity  (or  two  perfect  substitute  commodities), 

(1)  (2) 

~  share  the  same  market.  Let  q  ,q  "  t*ie  quantities  supplied  by 
i-. :  two  respectively,  in  period  t,  so  the  demand  equation  will  be 
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r^>  -Pq(1)-C(i) 
t  *tHt  t 

where  is  the  cost  of  producing  q^\  which  is  assumed  to  be  quadratic 

in  q^  (i.e.  *|c|^q^^).  The  present  value  of  the  total  profit  of  each 


firm  is 


I  BCr(1) 


where  6=  discount  factor.  We  assume  8*1.  Thus  the  payoff 
function  of  firm  i  can  be  expressed  as 

(PV)  -  Ip  q(i)-|c(i)q2(1). 
i  t«Q  t  t  2  t  Ht 


4. 4. 1.1.  Cournot-Nash  Solution  with  Normal  Information  Structure 


If  the  information  structure  of  both  firms  is  normal,  then  at  the 
start  of  each  stage,  both  firms  supply  the  market  with  their  product  simul¬ 
taneously,  and  as  a  result  no  one  firm  can  declare  a  strategy  which  is  a 
function  of  the  decision  of  the  other  one. 

Using  dynamic  programming  and  the  definition  of  Nash  solution  we 


find  that 


At  stage  1:  q 


li_ 

.(i)  * 


At  stage  0:  The  reaction  curve  for  firm  one  is 


j  (2) 

_ 2^2 -  +  ? 

2  1  o 


j  (1) j2 

dl~  1  d2 
1_d2C0  C1 


and  the  reaction  curve  of  firm  two  is 

j  _n(D 

(2)  o  o  ,  D 


,  .2  (2)  (2)  o  2  (2)  (2)  • 

1-d-c,  c  1-d.c  c. 

21o  L  2olJ 


(2)  2 
dl'Cl  d2 
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Solving  the  above  equations  simultaneously,  we  have 


qu>  * ^(1)c,(2)c«>  * c»> ) 

d2ca)c<i>ca,c(2).ca>ca,.cu)c(2) 

(2>  *  y-yl1^1  M2) + ^  *  ^M11^ 

“on  A U)c(l)c(2)  (2).ca)  a).c(2)  (2) 

2  o  1  o  1  ol  ol 

In  order  that  ,q^^  ’^oN^  ,C^1N^  a  ^eas^^e  Nash  solution,  we  should  have 

qoN^ ,qlN^ ,qoN^ *  and  q2N^  nonnegative*  Sufficient  conditions  for  the 
feasibility  of  the  equilibrium  Nash  solution  can  be  easily  found. 

4, 4. 1,2.  Nash  Solntion  with  DDIS 

If  the  information  structure  of  firm  one  is  decision-dependent,  then 
at  the  start  of  each  stage,  firm  two  supplies  the  market  with  its  product 
before  firm  one,  and  as  a  result  firm  one  may  declare  a  strategy  which  is  a 
function  of  the  decision  of  firm  two. 


At  stage  1:  q.. 


1L. 

.(1)  ’ 


rl 

(2)  ' 


At  stage  1  the  decision  of  any  firm  does  not  affect  the  profit  of  the  other 
firm  ac  this  stage,  so  it  is  meaningless  to  have  a  strategy  which  depends  on 
the  current  decision  of  the  other  firm,  but  we  can  have  a  strategy  which 
depends  on  the  past  decisions  of  the  other  firm. 

At  stage  0:  For  any  strategy  form  of  firm  two,  firm  one  will  respond  by  its 


reaction  curve 


j  (2)  .  ( 1 )  , 2 

d  -q  d.-c 

o  no _  ,  p  112 

.  (1)  (1)2  o  .  (1)  (1) ,2  * 

1-c  c.  d»  1-c  c,  d„ 

o  1  2  iol2J 
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Let  us  assume  that  the  space  of  admissible  strategies  of  firm  one  at  this 
stage  is  linear  in  the  current  decision  of  firm  two.  Hence  firm  one  will 
declare  a  strategy  of  the  form 


,a>  -  b,(2> 
o  .  o 


The  optimal  response  of  firm  two  will  be 

_  (2) 


P  (-d2c.(2)+d.  (1+b))  +  d  (1+b) 

O  Li.  1  0 


oD 


a«»2-c<2W2> 
o  l  1 


For  an  equilibrium  Nash  solution  with  DDIS  to  exist,  firm  one  should  have 


bq 


(2) 

oD 


d  -q(2) 

(1)  (1K2  +  o 


.(2) 


'  A  (1)  .  2 

drci  d2 
(1)  (1)  2 
1  o  C1  d2. 


If  we  substitute  for  q^Q  in  the  above  equation  and  solve  for  b,  we  find  that  b 
should  satisfy  the  following  algebraic  equation 


Ab  +  Bb  +  C 


where 


A 


(1)  (1)  2  (1)  (1)  2  (1)  2 
oco  C1  d2“dlPoCo  C1  d2  PoCl  d2 


.2  (2)  (1)  (1)2  (1)  (1)  2n  .2(2) 

?od2cl  "Co  C1  d2do"Co  C1  d2Po(dl"d2Cl  1 


oCl  d2 


,  A™ 

o  2  1 


J2c(1) 
roJ2Cl 


\ *~ /  ,  , a-  \ j. /  ,  (2)  (2)  ,2  r  (1)  ,2v  ,  , j 

P  d»c,  +  p  d„c,  +  c  c,  d„p  (d, -c,  d„)  +  d,d  c’  c 

^  1  Q  2  o  1  1  2  7  ~ 


2.  (2)  (2) 

,d  c  c, 

2  o  o  1 


Consequently,  we  may  have  a  unique  solution,  two  solutions,  or  no  solution 
which  satisfy  the  Nash  rationale  with  DDIS.  A  comparison  between  the  payoff 
functions  under  NIS  and  DDIS  can  be  made  for  firm  one,  but  due  to  manipulational 
difficulties  it  will  not  be  pursued  here. 
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It  is  our  conjecture  that  if  a  unique  equilibrium  Nash  solution  with 
DOIS  exists,  then  in  a  two-stage  dynamic  duopoly,  with  linear  DDIS,  linear 
demand  relation  and  quadratic  cost  functions,  the  profit  of  the  firm  possessing 
the  additional  decision  information  will  be  increased  compared  to  that  in  the 
absence  of  information.  We  failed  to  find  a  counterexample  to  our  guess  and 
the  following  numerical  example  is  illustrative. 


Example  4. 3: Let  the  market  demand  relation  be 


(1) .  _(2)  „  „  . 
qt  +qt  “6-2Pt  +  Pt+l» 


o*2. 

o 


Under  DDIS,  with  firm  one  declaring  a  decision-dependent  strategy,  the  Nash 
output  rate  value  is 


At 

stage 

0 •  q<2> 
noD 

-  12.057; 

U> 

qoD 

2 

3 

q (2) 

moD 

At 

stage 

i:  4* 

.  n(2)- 

qlD  ’ 

fl(2) 

qlD 

*  6 

.032. 

The  resulting  Nash  profit  of  each  firm  (PV^  is 

(PV  )  *  38.34  and  (PVQ)2  *  5.999. 


Clearly,  the  profit  of  firm  one  under  DDIS  is  more  than  its  corresponding 
profit  under  NIS. 


a 
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In  the  next  section  discrete  linear  Nash  game  with  DDIS  is  considered 
The  sufficient  condition  for  existence  of  a  solution,  on  the  basis  that  the 
space  of  admissible  strategies  of  both  players  is  linear,  will  be  given. 


4.4.2.  Discrete  LQ  Nash  Gape 


Let  us  consider  the  discrete  linear  system 

Vl  ■  Vk  +  \ulk  +  B2ku2k 


whe  re 


k  ik 

\eR  ’Uik£R 


and  u^  is  the  control  of  player  i  at  stage  k,  for  i=*l,2. 

The  objective  function  of  ?.  is 

l 

1  1  N-l 

Ji  *  2  XNQiNXN+2  ki0(xkQikXk  +  UikRik  Uik+UjkRjk  ujl 


where 


E<i1k)>0;  Rjk  •  «ik s 


If  P9  makes  the  first  move  at  every  stage  k,  and  the  information  available  to 
both  players  is  memoryless,  then  and  n^Ck)  =  (x^.u,^).  Let  the 

strategies  of  and  P ^  be  of  the  form 


ulk  =  Llk*k  +  L2ku2k 


u-t  *  F,  x,  . 
2k  <c  k 


? ,  will  assume  u,,  =  r.  x,  ,  solve  a  minimization  problem,  and  find  the  optimal 
1  2k  k  k 


u, ,  as 
1  cC 


U-,  «  "(R;^  +  B'  P,  3,.  )_1(3'  ?,  ,  .A.  )x. 
l.<  lk  lk  i,k+l  lx  lk  l,k-KL  x  a 


where  ?  ,  satisfies 

L ,  x 
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»llk  ■  I«iktft^Pi.w1ik<tB^.wV'1(1upi,wV1  (t-u 


and 


«lk  •  ^k+^V 


P  will  assume  ulk*  4^  +  L^u^,  and  solve  the  following  minimization  problem: 


minimize  J ^  such  that 


where 


and 


Vl  ‘  (Ak  +  8lkLlk)Xk+(B2k  +  BlkL2k)u2k 

J2  *  I  ^2rt+2  "!j(^2k’tk+“2k52k)“2k  +  "kMk,tk  +  \^U2k) 


<)2k  "  Q2k+LlkRlk  Llk 


;«>  .  R(2)+L'  r(2)i. 
R2k  R2k  2kRlk  L2k 


w  T '  R(2). 

\  L2kRlk  Llk‘ 


The  optimal  solution  is 

.-(2)  .  z,  -  s  s-l/o' 


u2k  “  "(R2k  +B2kP2,k+lB2k}  (B2kP2,k-flAk  +  Mk):<k 


where 


\  =  Ak  +  BlkLlk; 


2k 


B2k+BlkL2k 


and  P.  ,  satisfies 
2,k 


?2,k’ia2k+^P2,k^V(^P2,k482k+>'k)(S2;)+5k?2,k+lS2k>'i(B2kP2,k+2ik^) 


(4.2) 


P  =  0 
2,N  '2M 


T-  _  ,  r  _  /g(2)  j.3*  D  5  o  I  +  M ’ )  ,  then  sufficient 

Ir  P2  chooses  Fk=-(R2k  + 32k‘ 2 ,k+l  2k'  lB2k*  2,k+L' *  V’ 

conditions  for  existence  of  a  Mash  equilibrium  solution  with  DDIS  can  be 
stated  in  the  following  proposition. 
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fi 


Proposition  4.2:  If  there  exists  an  (m^xn)  matrix  and  an  (n»lk**2k)  matrix 
L^k  such  that 

-Llk+L2k(R2k)+B2kP2,k+lB2k)  1<B2kP2,k+l\+Mk)  *  <Elk>+BikPl ,k+lV’1(Bik  Pl,k+lV 

where  P1  ?2  fc+1  satisfies  equations  (4 . 1) ,  (4.2)  respectively,  then 


*lk  *  LlkXk  +  L2kU2k 


U2k  *  -<R2k)  +  B2kP2,k+lB2k)'1(i2kP2,k+A  +  »!;)\ 


-1/5. 


constitute  a  Nash  equilibrium  solution  with  DDIS. 


D 


i 
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CHAPTER  5 

INFORMATION  STRUCTURE, OPTIMAL  COORDINATION, AND  A  GAME  MODEL  OF  DUOPOLY 

5.1.  Introduction 

In  this  chapter  we  discuss  the  idea  of  using  DDIS  in  inducing  the 
Nash  followers  in  a  LF  game  to  behave  as  members  of  a  team  with  the  leader's 
objective  function  as  the  objective  of  the  team.  We  introduce  a  static  market 
model  of  duopoly  where  the  government  interferes  in  this  market.  We  show  that 
by  incorporating  the  decisions  of  the  Nash  duopolists  in  its  strategy,  the 
goverment  can  always  succeed  in  inducing  them  to  cooperate.  In  Section  5.2,  the 
incentive  problem  is  formulated  as  an  (n+1) -person  LF  game  with  one  leader  and  n 
Nash  followers.  The  leader's  problem  is  to  force  the  followers  to  act  in  such 
a  way  that  even  though  each  one  wants  to  optimize  his  objective  function,  they 
also  optimize  the  objective  function  of  the  leader.  In  Section  5.3,  the  objec¬ 
tive  function  of  the  leader  is  taken  to  be  a  convex  combination  of  the  objectives 
of  the  followers  and  a  quadratic  cost  function  on  the  strategy  of  the  leader. 

The  incentive  mechanise  of  the  organization  is  formulated  by  incorporating  the 
decisions  of  the  followers  in  the  strategy  of  the  leader.  3v  employing  such 
forms  of  strategies,  the  leader  can  force  the  followers  to  behave  as  members  of 
a  team,  with  their  composite  objective  function  contained  in  the  objective  func¬ 
tion  of  the  team.  In  the  first  part  of  Section  5.4,  we  consider  a  general  static 
market  model  of  duopoly  where  the  government  interfers  in  the  market  throuan 
adjusting  the  effective  income  of  the  potential  buyers  of  the  commodity.  We  show 
that  by  adopting  the  incentive  mechanism  described  in  Section  5.2,  the  government 
can  enforce  the  two  competing  firms  to  cooperate  and  achieve  the  Pareto-oct inal 
solution.  In  the  second  part  of  Section  5.4,  we  analyze  in  detail  the  case  of  a 
linear  market  demand  relation  ana  a  quadratic  cost  function.  We  obtain  analytic 


solutions  for  the  optimal  strategies  of  the  firms  and  the  government,  where 
the  optiaml  strategy  of  each  firm  maximizes  in  the  Mash  sense  its  own  payoff 
function,  and  also  maximizes  the  government's  payoff  function,  and  the  optimal 
strategy  of  the  government  maximizes  its  own  objective  function  (which  reflects 
the  welfare  of  the  two  firms)  and  forces  cooperation  between  the  two  firms. 

We  show  that  in  the  limit  as  the  unit  cost  of  applying  government  control  tends 
to  infinity,  the  enforced  cooperative  optimal  controls  and  profits  tends  to  the 
voluntary  cooperative  ones.  We  discuss  the  general  properties  in  terms  of 
marginal  cost,  price,  and  the  consumers'  welfare  in  the  context  of  this  problem. 

5.2.  Formulation  of  the  Coordination  Problem 

A  multi-person  decision  problem  is  a  team  decision  problem  if  the 
decision  makers  (DM's)  share  a  common  goal  but  they  have  different  information 
structures.  A  team  decision  problem  arises  in  an  organization  in  which  the 
objective  is  to  maximize  the  payoff  function  of  the  leader  (central  manager) . 

If  the  followers  (local  managers)  share  the  same  objective  and  they  behave  in 
such  a  way  as  to  maximize  the  leader's  payoff  function,  then  the  model  of  such 
an  organization  is  a  team  model. 

In  general,  the  followers  do  not  share  the  same  objective  with  the 
leader  but  each  has  his  own  objective  function  which  he  tries  to  maximize.  In 
such  a  case  the  decision  problem  for  the  organization  can  be  formulated  as  a 
multi-person  game.  This  multi-person  game  consists  of  the  leader  and  the 
followers.  The  leader  is  able  to  announce  his  strategy  before  the  other 
players:  hence,  he  may  impose  a  solution  which  is  favorable  to  himself.  Let 
the  leader  choose  an  overall  objective  for  the  organization  with  which  he  mav 
coordinate  the  objecti”e  functions  of  the  followers.  Mow,  if  the  leader  can 
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choose  a  strategy  which  induces  the  followers  to  behave  as  if  they  were  members 
of  a  team  with  the  le_  er's  objective  as  the  goal  of  the  team,  then  we  say 
that  an  optimal  coordination  mechanism  exists. 

The  above  discussion  can  be  formalized  as 

(1)  Let  I  *  {0,1, . . . ,n}  denote  the  set  of  DM's  where  i*0  denotes  the 
leader  and  i»l,...,n  denotes  the  followers. 

(2)  Let  lL*{u^},  with  i€  I,  denote  the  decision  space  of  DM^. 

(3)  Let  :  U-*-R,  with  i€  I,  denote  the  payoff  function  of  DM^,  which  is 

n 

defined  on  the  joint  decision  space  U*  it  U .  . 

i-0  i 

(4)  Let  y  :H-*U  denote  the  strategy  of  the  leader,  mapping  the  infor- 

o  o 

mation  space  H  to  the  decision  space  Uq. 

(5)  The  decision  problem  for  the  organization  can  be  formalized  as  an 

(n+1) -person  LF  game  with  DM  as  the  leader  and  { DM . } ?  ,  as  Mash 

o  1  1*1 

followers.  This  game  is  defined  as 

Definition  5.1:  if  there  exists  a  mapping  Ti  ;  Uq  -  U  ,  for  i*l,...,n  such  that 

for  any  u  e  U  , 
o  o 

J. (Tu  ;u  )  >  J  (Tu  |u  ;u  )  Vu.€U. 

i  oo  i  o 1  1  o  l  i 

where  Tu  | u  *  (T  u  , . . . ,T  u  u  ,T  .  u  , . . . ,T  u  ),  and  if  there  exists  a 
o  l  i  o  l-i  o’  l  it1  o  no 

u  e  U  such  that 
os  o 

J  (Tu  ;u  )  >  J  (Tu  ,u  ) 
o  os  os  o  o  o 

then  the  strategies  (u  ,u.  , ...,u  ),  where  Tu  =  (u,  ),  are  called 

os  Is  ns  os  Is  ns 

IF  strategies  with  DM  as  the  leader  and  DM.  for  i=l,...,n  as  Xash 


rollovers 
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The  leader's  problem  is  to  design  his  strategy  so  as  to  induce  the 
followers  to  optimize  the  leader's  objective  function  while  playing  optimally 
with  respect  to  their  individual  objective  functions  as  well. 

5.3.  The  Coordination  Mechanism  to  Enforce  Pareto-Optimalitv 

Let  Ji(uQ,u1, . . .  ,un)  be  the  payoff  function  of  DM.  for  i»l,...,n; 

n  i  2 

and  let  JQ  -  J  c0uo  be  the  payoff  function  of  the  leader,  where  c  >  0, 

n 

and  This  objective  function  reflects  the  welfare  of  the  entire 

organization  since  it  comprises  a  convex  combination  of  the  payoff  functions  of 

its  members  plus  a  quadratic  cost  on  the  strategy  of  the  leader.  The  leader's 

objective  is  to  see  that  (which  is  also  the  goal  of  the  team)  is  maximized 

by  all  the  DM's.  To  have  a  well-defined  problem  let  us  assume  that  there 

exists  at  least  one  team  strategy  u  =*  (u  ,...,ut)  which  maximizes  J  over  all 

on  o 

uSU.  The  leader  will  seek  a  strategy  v  (.)  such  that  if  u  -v  (.)  is  sub- 

o  o  o 

stituted  in  then  D>L  will  find  a  unique  solution  (u^)  as  his  optimal 

i-n  the  Nash  sense,  and  this  solution  will  also  lead  to  the  maximum 

value  of  Jq.  This  can  be  mathematically  formalized:  The  leader  should  choose 

a  strategy  u^»yq(-)  such  that  when  DM_.  plays  non-cooperativelv ,  he  will  choose 

u.,T  which  satis fv 
iN 


Ji(Yo(’),UlN’”-’UiN . UnN}  -  Ji(vo('),ulX . Ui . unN} 


and 


. %*>  '  . -!;>• 


.et  the  leader  adopt  a  strategy  of  the  form 


t  n  t 

u  =  u  -  . -  ,  A.  (u.-u. ) 
o  o  i=l  i  i 
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The  leader  makes  use  of  the  information  available  to  him  (the  decisions  of 
the  followers)  by  formulating  a  strategy  which  is  a  function  of  these  decisions. 
By  choosing  such  a  representation  (the  reader  is  referred  to  [26] 
for  a  game-theoretic  interpretation  of  such  a  representation),  the  leader 
is  forcing  each  of  the  followers  to  choose  a  strategy  which  is  equivalent  to  u^ 
or  else  they  will  be  penalized.  So  if  there  exists  a  real  sequence  such 

that  the  optimal  response  of  ^M^(u^)  is  equivalent  to  u^ ,  then  we  say  that  the 
coordination  mechanism  to  enforce  Pareto-optimality  exists.  The  fact  that  the 
leader  can  enforce  cooperation  (formation  of  a  team  which  corresponds  to  co¬ 
operation)  among  the  competing  followers  has  deep  economic  implications, 
especially  in  a  market  structure. 

For  cooperation  to  replace  competition  among  several  firms  in  a  market 
(with  no  leader  to  enforce  Pareto-optimality),  P'>:  only  must  the  total  maximum 
profit  exceed  the  combined  competitive  return,  but  also  none  of  the  participants 
in  the  cooperative  group  must  be  able  to  achieve  higher  profit  by  means  of  some 
feasible  strategy  while  all  others  stick  to  the  cooperative  agreement. 

In  the  case  when  competition  dominates  cooperation  and  the  firms 
play  nonef f iciently,  then  the  leader  (e.g.  the  government)  may  interfer  to 
enforce  cooperation.  If  the  total  maximum  profit  obtained  after  taking  inco 
account  the  cost  of  enforcing  the  cooperation  still  exceeds  the  combined 
competitive  return,  then  the  leader  has  an  incentive  to  interfere  and  enforce 
cooperation,  as  will  be  shown  in  the  next  section. 
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5.4.  A  Market  Model  of  Duopoly  and  Enforcement  of  Pareto-Optimalitv 

Let  there  be  two  firms  which  share  the  production  of  a  commodity 
(or  two  undifferentiated  commodities)  and  have  access  to  the  same  potential 
buyers.  The  two  firms  use  the  quantity  of  production  as  their  strategic 
variable.  The  demand  for  this  commodity  depends  on  the  market  price  and  the 
consumers'  effective  income.  We  assume  that  the  government  interferes  in  the 
market  through  the  consumers  by  giving  them  subsidies  or  applying  income  taxes, 
hence  the  strategic  variable  of  the  government  has  a  direct  effect  on  the 
income  of  the  consumers. 

As  a  coordinator  of  the  market,'*"  the  objective  function  of  the 
government  is  a  convex  combination  of  the  payoff  functions  of  the  two  firms 
and  a  quadratic  cost  function  on  its  control;  and  its  problem  is  to  design  a 
strategy  which  induces  the  two  firms  to  behave  as  members  of  a  team  with  the 
objective  function  of  the  government  as  the  objective  function  of  the  team,  while 
maximizing  their  payoff  functions  as  well. 

5.4.1.  General  Model 

Let  q  stand  for  the  output  rate  produced  by  firm  i  and  0  stand  for 
the  total  marketed  quantity.  Hence, 

Q  =  qi  +  q2- 


Let  us  suppose  that  the  aggregate  market  demand  can  be  represented  by  a  con¬ 
tinuous  function  as  follows 


"‘"The  government  may  find  it  necessarv  to  interfer  ■ 
achieve  a  more  efficient  economy,  especiailv  if  the  conne t i t 
the  participants  . 
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such  that  the  price  (p)  is  an  increasing  function  of  the  effective  income  of 
the  consumers  (I),  while  it  is  a  decreasing  function  of  the  demanded  quantity 
(Q). 

The  effective  income  of  the  consumers  (I)  can  be  decomposed  into  two 

components  I  and  i  ,  where  I  is  the  nominal  Income  of  the  consumers  which 
o  o  O 

the  government  has  no  control  over;  and  iQ  is  that  part  of  the  consumer's  income 
which  the  government  can  directly  adjust  through  taxes  or  subsidies.  Hence  iQ 
can  be  taken  to  be  the  strategic  variable  of  the  government.  Consequently, 
equation  (5.1)  can  be  reformulated  as 


Iff 

> 


p  -  f(Q,io). 


It  is  reasonable  to  assume  that  each  firm  wants  to  maximize  its  net  profit 
where 

Ji  * 

^i^i)  is  an  increasing  function  in  q^  and  represents  what  it  costs  firm  i  to 
produce  at  a  rate  q^. 

The  objective  of  the  government  as  the  coordinator  of  the  market  is 
to  find  a  joint  strategy  (^t  ;q2t  ;iot^  that  ®axiniz*s  J0(ql’q2’V  where 


1  2 

Jo(ql’q2’V  "  alJl  +  ct2‘J2  ”  2  Coio 


i .  >  0 ,  c  >  0 ,  and  a,  +  =  1 . 

1  O  1  4 


The  optimal  joint  strategy  (q^t ;q?t ;i  )  should  satisfy  the  following 

for  i,j*l,2  and  i^j  (5.2) 


5  f  >o  .  5h1(qi) 

5Q  3q,  (aiqi  +  ajqj}  +  V  "ai  iq.  ° 


and 


SSH1 
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3f/3i 

iQ  •“c^(Vl  +  a2q2)*  <5-3> 

o 

To  have  a  well-defined  problem,  we  are  going  to  assume  that  there  exists  at 
least  one  joint  feasible  strategy  (by  feasible  strategy  we  mean  one  with  non- 
negative  output  rates)  which  achieves  the  maximum  of  JQ.  Hence,  we  can  see  from 
equation  (5.3)  that  the  optimal  strategy  for  the  government  (iQt)  is  to  give 
subsidies  for  the  consumers  and  consequently  increase  their  effective  income. 

The  increase  of  the  effective  income  of  the  consumers  enables  them  to  be 
willing  to  buy  any  fixed  quantity  at  a  higher  price,  which  in  effect  increases 
tba  firms'  profits.  To  induce  firm  i  to  produce  at  an  output  rate  which  is 
equal  to  qifc  for  i-1,2,  the  government  chooses  a  decision  rule  (strategy)  of 
the  following  representation 


i 

o 


St  *  Ai(Vit)  * 


(5.4) 


where  and  are  real  constants  to  be  chosen  by  the  government  to  enforce 
Pareto-optimality .  Clearly,  if  the  output  rate  of  firm  i  equals  q^fc  for  i*l,2, 
then  iQ *  i  in  equation  (5.4),  that  is  the  strategy  of  the  government  will  be 
optimal,  and  the  Indirect  cooperation  between  the  two  firms  will  also  be 
achieved . 

If  1Q  as  given  in  equation  (5.4)  is  substituted  in  the  payoff  function 
of  firm  i,  then  firm  i  will  find  its  optimal  output  rate  by  maximizing  its 
payoff  function  on  the  basis  that  the  output  rate  of  the  other  duopolist  is 
given.  The  necessary  conditions  for  the  above  maximization  problem  can  be 
mathematically  described  as 


r  r* 


3Ji  r  3£  J2l  . *  _il_  ,  8hi*ql* 


V3Q  3q±  3io  3qt 


for  1*1,2 


(5.5) 


where 


ja.- 1 

3q, 


K"V 


The  leader's  problem  is  to  choose  the  real  numbers  and  A^,  which  will 
cause  the  optimal  response  of  firm  1  to  be  equal  to  q^.  If  A^  Is  chosen  to  be 


3hi(qi)  £  (o  i  )  a  JL£ 

3qt  £(Q,io)"qi3Q 


(5.6) 


M  9^o  f  Vqlt 

io’1ot 

Vq2t 

then  the  optimal  output  rate  In  the  Nash  sense  of  firm  1  will  equal  qifc.  We 
conclude  this  section  by  the  following  proposition,  which  we  have  just  proved. 
Proposition  5.1:  In  a  market  duopoly ,  by  declaring  a  strategy  of  the  form  given 
in  equation  (5.4) ,  with  A^  satisfying  equation  (5.6) ,  the  government  can  force  the 
two  firms  to  choose  a  Pareto-optimal  output  rate,  while  maximizing  their 
corresponding  profit  as  well. 

5.4.2.  A  Linear  Model  Case 

In  this  section,  we  will  determine  analytic  solutions  for  the 
coordination  problem  presented  in  the  last  section.  This  will  aid  in  under¬ 
standing  the  economic  implication  of  the  coordination  problem. 

Let  the  market  demand  relation  satisfy  the  following  linear  equation 
(for  a  discussion  of  the  linear  demand  relation,  the  reader  is  referred  to  [36] 


Q  ■  d  +d.p+d,i 

‘  n  I  r  7  r 


where  dQ  and  d^  are  positive  constants,  while  d^  is  a  negative  constant. 

Let  the  cost  function  h^Cq^  be  of  the  form 

hi(qi)  -  \  C ^ 

where  and  are  positive  constants.  For  the  linear  model  case,  we  have 
the  following  corollary. 

Corollary  1:  In  a  market  duopoly  with  a  linear  demand  relation  and  a  quadratic 
cost  function,  the  government  can  force  cooperation  between  the  two  firms  by 


choosing  to  be 


,’it(2-cidl)+,1Jt-do  +  ‘121c 


Vic 


for  i,j-l,2  and  i?*j 


where 


,  bHdt 
'ic  ;  J  ,-l 

VVj 


dot  '  TZ-  ‘Vic  +  VW 

1  O  . 


,2 

Vid2 

b  »  -1 - -  J'  4 

«  dlc0 


s.  -da. 
i  o  i 


a.  ■  2a.  -  a.d.c,  +  -r -  . 

i  i  i  1  i  d,c 

1  o 

Proof :  The  proof  is  straightforward  and  follows  the  same  lines  given  in  the 

discussion  of  the  general  case.  a 

Instead  of  absorbing  the  total  cost  of  cooperative  enforcement,  the 

government  may  charge  the  firms  the  cost  of  policing  the  cooperation.  Let  the  cost 

1-2  _  _ 

charged  to  firm  i  be  r  c.i  ,  where  a.c.  +x,c,  *c  .  The  objective  function  of 

£  L  O  1  1  -  L  O 


the  government  (JQ)  will  be 


a,J,  +,;t2^2’  wbere  the  profit  of  firm  i  (J^)  is 

1  2  1  -  .2 

'  2  Ciqi"  2  Vo’ 


It  can  be  shown  that  the  government  can  £orce  cooperation  between  the  two 
firms  by  choosing  of  a  slightly  different  form. 


W^iV+'Mt-Wot 


Vlt  +  2Jl1ot 


(5.7) 


where  q^,  q  ,  and  iQt  are  as  previously  described. 

In  the  remainder  of  the  chapter  we  consider  the  Important 

special  case  when  the  objective  function  of  the  government  (J  )  is  the  total 

o 

profit  of  the  two  firms  and  a  quadratic  cost  on  the  strategy  of  the  government, 
1  2 

i.e.,  J  ■  c  i  .  By  the  same  techniques  as  above,  we  find  that 

O  X  Z  L  O  O 


-d  d  c,c 
10  1o 


c0[dlclc2“2(cl+c2)dl1’(cl+c2)d2 


(5.8) 


d2do(cl+c2) _ 

c0tdlclc2_2(cl+c2)dl 1 - <ci+c2) d2 


(5.9) 


Co(cldl)  +  2d2(ci'H:1) 
-dld2CJCo 


(5.10) 


q.t,  i  .  and  A.  are  sketched  as  a  function  of  the  parameter  c  in  Fig.  5.1, 
it  ot  1  O’ 

Fig.  5.2,  and  Fig.  5.3,  respectively ,  where  cq  denotes  the  cost  of  applying  one 
unit  of  government  control. 

From  Fig.  5.1  it  can  be  seen  that  the  optimal  output  rate  of  the  firms 
are  feasible  (the  q^'s  are  non-negative)  if  and  only  if 


c  (WJ2 

°  ^cic2~  2(c^+Cj)d^ 


Hence,  the  optimal  feasible  strategy  of  the  government  amounts  to  increasing  the 
effective  income  of  the  buyers  as  shown  in  Fig.  5.2. 


igure  5.2.  The  optimal  government  control  vs.  its  one  unit  cost 
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5. 4 .2.1.  Forced  Cooperation  vs  Voluntary  Cooperation 

If  the  two  firms  agree  to  cooperate  with  Jj+J2  M  their  objective 
function  (they  agree  to  form  a  cartel,  where  the  members  retain  their  separate 
identities  and  separate  control  over  their  policies) ,  then  the  optimal  output 
of  the  two  members  of  the  group  can  be  found  by  solving  a  straightforward 
maximization  problem  of  the  total  profit  of  the  two  firms  on  the  basis  of  the 
demand  conditions.  This  optimal  output  rate  is 


-c.d.d 

_ J_L_2_ 


1  -2c1d1-2c2d1  +  djc1c2 


for  i,j»l,2  and  i^j 


and  j”  is  the  optimal  total  profit.  The  trouble  with  this  solution  is  that  it 
may  not  be  practically  attained,  since  any  firm  can  depart  from  this  solution 
without  being  penalized  for  increasidg  its  profits.  For  example,  if  firm  1 
supposes  that  firm  2  is  going  to  stick  to  the  cooperative  solution 

,  firm  1  chooses  its  optimal  output  rate  as  the  one 

d 


-clVo 

~2cI1r2c2dL  *  dlclC2 


which  satisfies  its  rational  reaction  relation  q  *  •= — —r — x  •  ~r  q,,.  Then 

2  1  2"Cldl  2"Cldl  2 

d  (-c  d  -2c  d.+d^c  c  ) 

we  have  q.  *  - - - - - = -  .  Hence,  by  departing  from  the 

(2-c^)  (-2c1d1-2c2d1+d^c1c2) 

cooperative  solution,  firm  1  managed  to  increase  its  profits,  of  course,  the 
same  argument  applies  to  firm  2.  As  a  result  no  firm  may  be  willing  to  join 
the  coalition,  knowing  in  advance  that  it  and  all  others  may  depart  from 
the  cooperative  agreement. 

Earlier  in  this  chapter,  we  showed  that  the  government  can  force 
cooperation  between  the  two  firms  if  it  announces  a  strategy  of  the  form 
given  in  equation  (5 .4) ,  where  A.  satisfies  equation  (5.10).  The  optimal  output 
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race  of  Che  two  firms  and  Che  scracegic  variable  of  Che  govemmenc  depend  on 
cq.  If  cq  is  large  enough,  Chen  Che  Cocal  profic  (JQ)  of  Che  cwo  firms  under 
forced  cooperaCive  conditions,  will  become  arbicrarily  close  Co  Cheir  cocal 

profic  (J°)  under  Che  volunCary  coaliCion  condicions,  since  ic  can  be  seen  chac 

o  o 

as  c  -»■•,  q .  +  q , ,  i  +  0,  and  J  -►  J  • 
o  ’  niC  ’i  ot  oo 

For  large  enough  c  Chere  is  a  seeming  paradox,  since,  alchough  in 

o 

boch  sicuacions  of  volunCary  and  enforced  coopera cion,  Che  govemmenc  conCribuCes 
noching  or  almosC  noChing  Co  Che  market,  and  each  firm  has  Che  same  oucpuc 
race,  boch  firms  have  an  incencive  CO  deparC  from  Che  PareCo-opcimal  soluCion 
under  volunCary  cooperacive  condicions;  whereas  no  firm  would  change  iCs  soluCion 
under  enforced  cooperacive  condicions, since  ic  can  only  lose  by  deparcing  from 
Che  equilibrium  soluCion.  The  answer  Co  Chis  paradox  is  chac  Che  sCraCegy  of 

f 

Che  govemmenc  has  Cwo  differenc  represencacions  for  Che  same  value  in  chese 
cwo  cases  (iQ  -  0  for  Che  volunCary  case,  while  iQ  *  A^Cq^-q^)  +  A2(q2-q2t) 
for  Che  enforced  cooperacion) .  The  represenCaCion  of  Che  sCraCegy  in  Che  enforced 
Pareco-opcimal  case  has  a  Chreacening  power,  which  direecs  each  firm  co  behave 
in  a  certain  way  or  else  ic  will  incur  addiCional  coses. 

In  summary,  we  can  say  that  by  being  in  a  coordinating  position  in 
the  market,  the  government  forces  the  two  firms  to  cooperate,  thus  obtaining  Che 
maximal  monopolist  joint  profit,  but  with  arbitrarily  small  cost. 

The  following  numerical  example  illustrates  the  basic  ideas 
presented  in  this  section. 


5. 4. 2. 2.  Numerical  Example 

Let  the  market  demand  relation  be 


qx  +  q2  »  10  -  2p+  iQ. 


Let  us  assume  chat  Che  Cwo  firms  have  identical  cost  functions  with 

C1  ■  c2  ’  1- 

Substituting  the  values  of  dQ,  d^  d 2,  c^  c 2  in  equations  (5.7),  (5.8),  (5.9), 
we  find  that 


The  optimal  output  rate  of  each  firm  (q.  )  -  - ;r  • 

It  12c  -2 
o 

The  value  of  the  control  of  the  government  (i  )  *  j 

o 

The  strategy  of  the  government  (iQ)  is 


io  "  (6c  -1^  +  AiC(ql~qlt)+(q2*q2t)J 


where 


-c  +2 

K  -  — - —  . 
i  c 

o 


For  c  »10,  we  find  that 
o 


q ,  -  1.695;  i  -  0.1695 

it  ot 


Ax  -  A2  -  -0.8 


p  -  3.39 

J.+J,  -  8.6190;  \  c  i2  -  0.1436. 

12  2  o 


Hence, 


J  »  J.  +  J,  -4  c  i2  -  8.475. 
o  1  2  2  o 


For  c  *  1000,  we  find  that 
o  ’ 


q .  -  1.667;  i  _  =  0.0017 

it  o  t 

A,  »  -0.998,  p  *  3.333;  ^  c  i2  =  0.0014 

l  Zoo 


J.+J-  -  8.333  and  J  =8.332. 
12  o 
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i 


» 

» 


i. 


If  the  two  firms  agreed  to  cooperate  (without  the  interference  of  the  government), 
then  the  Pareto-optimal  solution  would  be 

q°  -  q°  -  1.667 

i°  -  0 
o 

and  J  *  J,  + «  8.333. 

o  1  2 

It  can  be  seen  from  the  above  that  for  co*10,  the  total  profits  after  taking  into 
account  the  cost  of  policing  the  cooperation  is  bigger  than  the  total  profits 
under  voluntary  conditions.  In  such  cases  the  government  may  charge  the  two 
firms  the  cost  of  its  control,  for  example  when  c,  ■  c„ *  5  with  c,  + c„ * c  ,  the 
government,  by  choosing  * -0.3998  (see  equation  (5.7)),  can  enforce  cooperation 
between  the  two  firms,  charge  them  the  cost  of  its  control,  and  still  make  the 
total  profit  of  the  two  firms  exceed  the  voluntary  monopolistic  total  profit. 

By  increasing  the  consumers'  demand  for  the  marketed  commodity,  the  government 
altered  the  market  structure  and  consequently,  increased  the  total  profit  of 
the  two  firms  compared  to  their  profit  under  volunary  cooperative  conditions. 

For  large  enough  values  of  cq  (cq  *  1000  Is  large  enough  for  our 
example) ,  the  values  of  the  optimal  strategies  and  profits  of  the  firms  and 
government  under  forced  cooperation  conditions  are  arbitrarily  close  to  the 
corresponding  values  under  voluntary  cooperation  conditions. 

The  optimal  solution  under  enforced  cooperation  conditions  is  a 
LF  equilibrium  solution  with  the  government  as  a  leader  and  the  two 
firms  as  Nash  followers.  Hence,  there  is  no  incentive  for  any  participant  to 


» 


leave  this  solution. 
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Before  concluding  this  section,  several  remarks  should  be  mentioned. 

R5.1  If  we  compare  the  output  rate  and  the  price  in  the  market  under  forced 

cooperation  and  voluntary  cooperation,  we  find  that 

-d. d  (c.+c_) 

~o  o.o  1  o  1  2 

Q  -q+q - 2 - 

-2c1d1-2c2d1  +  d^c1c2 

and  o  do(c1+c2)-dldoc1c2 

p  ae  ■"'■■■  ■' 

~2cldl~2c2dl +  dlClC2 


while 


~dldo(cl'tc2)eo _ 

Co  ["2cldl'2c2dl'Wlclc2 <C1+C2)  d2 


p  _  Cotdo(cl+C2)~dldoClC2j _ 

Co^*‘2cldl'2c2dl+dlclC2^”^Cl+C2)d2 

/  (ci+c2^d2  \ 

For  all  feasible  values  of  c  c  >(—= - )}  the  output  rate 

V  dlclc2-2(cl+c2)dl/ 

and  the  price  under  forced  cooperation  are  larger  than  the  corresponding 

output  rate  and  price  under  voluntary  cooperation.  The  welfare  of  the 

consumers  (W)  is  appropriately  defined  as  the  consumer's  surplus  which 

Q.N 

can  be  mathematically  formalized  as  WCQ^)  *  /  ( pCsJ-P^Jds,  where  and 

o  1 

are  the  equilibrium  values  of  price  and  quantity,  respectively.  It  can 
be  verified  that  W  is  worse  off  under  voluntary  cooperation, hence,  the  govern¬ 
ment  increases  the  social  welfare  of  the  consumers  compared  to  that  under 
voluntary  cooperative  conditions.  But,  in  the  limit  as  cq^»,  the  price,  the 
output  rate,  and  the  social  welfare  of  the  consumers  will  be  equal  under 
both  conditions. 

Hj .  2  The  price  elasticity  of  demand  (n)  at  both  the  voluntary  cooperative  point  and 


for  all  values  of  c  at  the  enforced  cooperative  point  is  identical  and  it  equals 

o 
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» 


L-  '  -• 


n  -  <d0(c1+c2)-d1doc1c2)/(do(c1+c2)). 

It  can  be  seen  that  the  market  is  always  price  elastic  under  both 
conditions . 

R5.3JUnder  voluntary  cooperative  conditions  the  output  rate  (q^)  satisfies 


o  qi  /,  ,  ix 

P  ‘  ciqi  "  (1  +  ^° 


for  i  ■  1,2. 


Under  forced  cooperative  conditions,  the  output  rate  (q^t)  satisfies 


P  -  c  .q  .  _ 

t  l  it 


c  c  d.+2d,(c  +c,) 
-1  ...oil  2  1  2  \ 

—  qitU+ - 2 - ' 

dl  iC  dfc.c 

1  j  o 


for  i  ■  1,2. 


R5.4 


From  the  above  equations,  it  can  be  shown  that  under  voluntary  cooperative 
conditions,  the  price  (p)  is  higher  than  the  marginal  cost  (MC)  for  both 
firms;  whereas  under  enforced  cooperation  conditions,  the  price  (p)  is 
higher  than  the  marginal  cost  (MC)  for  both  firms  if  and  only  if 


2d  i 


c  >  __ 
o  -d. 


2d2  (c1+c2)d2 

~dl  d?clV2<cl+C2)dl 


(The  feasibility  conditions  is  also  satisfied  since 
2 

-  .)  For  each  firm  the  difference  between  p  and  MC 


is  smaller  for  the  enforced  solution  than  for  the  voluntary  one;  thus  the 
former  solution  is  closer  to  the  group  rationality  solution  (to  satisfy 
group  rationality,  which  includes  the  buyers,  the  price  must  equal  marginal 
cost)  than  the  latter. 

It  can  be  noticed  from  the  previous  example  that  depending  on  cq,  the 
total  profits  of  the  two  firms  under  enforced  cooperation  conditions  maybe  larger, 
equal,  or  smaller  than  the  corresponding  profits  under  voluntary  conditions . 

The  procedure  to  find  the  region  of  cq  for  which  that  is  true,  is  straightforward 
but  lengthy.  It  will  not  be  pursued  here. 
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R5.5;  Our  derailed  analysis  was  constrained  to  the  case  where  the  objective 

function  of  the  government  (J  )  is  the  total  profit  of  the  two  firms  and 

o 

a  quadratic  cost'  on  the  strategy  of  the  government,  similar  results  are 

expected  when  the  objective  function  of  the  government  (J  )  is  the 

o 

convex  combination  of  the  profit  functions  of  the  two  firms  and  a 

1  2 

quadratic  cost  on  the  control  of  the  government,  i.e.,  J  » a, J,+a_JL  -  £  c  i  . 

o  1  1  l  2.  2  o  o 
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CHAPTER  6 

ON  SOME  STOCHASTIC  STATIC  TEAM  LEADER-FOLLOWER  PROBLEMS 

6.1.  Introduction 

In  this  chapter  we  deal  with  two  static,  stochastic  leader- 
follower  team  problems,  where  each  player  has  a  quadratic  cost  function  and 
part  of  his  Information  is  a  linear  function  of  Gaussian  random  variables . 

In  Section  6.2,  we  consider  the  problem  of  a  3-person  stochastic  optimal 
coordination,  where  the  coordinator  desires  to  induce  the  two  noncooperative 
(in  Nash  sense)  players  to  minimize  his  cost  function,  even  though  each  player 
minimizes  his  own  cost  function.  The  cost  function  of  the  coordinator  is 
a  convex  combination  of  the  noncooperative  players'  cost  functions.  The 
information  structure  of  the  game  is  nested  and  dynamic,  whereby  the 
coordinator  not  only  knows  whatever  the  other  players  know,  but  also  detect 
exactly  their  decision  variables.  We  show  that  by  incorporating  the  decisions 
of  the  other  players,  the  coordinator  can,  under  a  certain  condition, 
successfully  enforce  the  team  solution  with  his  objective  as  the  goal  of  the 
team. 

In  Section  6.3,  we  consider  a  two-person  leader-follower  game,  in 
which  the  leader  does  not  completely  detect  the  decision  variable  of  the 
follower.  To  achieve  the  best  possible  outcome  of  the  leader,  we  define 
a  new  modified  team  problem  after  taking  into  account  the  optimal  response  of 
the  undetected  action  of  the  follower.  We  find  that  the  leader  can,  under 
a  ceratin  condition,  achieve  this  new  tight  lower  bound.  Finally,  we  give 
a  numerical  example  to  illustrate  our  procedure. 


104 


i 

•v' 

■ 

*» 

m 


■ 

r 


6.2.  Stochastic  Optimal  Coordination 

Let  x  be  a  Gaussian  random  vector  over  a  probability  space 

x  :  S Rn  with  zero  mean  and  £  covariance.  Let  Uj^  denote  the  decision  variable 

“i 

of  decision  maker  (DM)^  which  takes  values  in  R  for  i- 0,1,2.  The  objec¬ 
tive  functional  of  (DM)^  for  1*1,2  is  defined  by 


G. (x,u  ,u, ,u~)  *  r  u'D.u,  +  u!q,x+u'r.u, +t  u's.u  + u  L.u, + u't.x. 
i  ’  o  1  2  2  i  i  i  ixi  i  i  j  2  oio  oii  oi 


The  objective  functional  of  DMq  (the  coordinator)  is  a  convex  combination 
of  the  objective  functionals  of  the  other  two  DM’s,  i.e. 

G  (x,u  ,u. ,u2>  -  a^G^x.u^u^.Uj)  +  a^Cx.u^.u^ ,u2> 

where  l>a^i0,  and  o^  +  a2*l.  To  describe  the  information  structure  of  the 
game,  let  y^  and  y2  be  two  random  vectors  defined  as 


y  *  H^x  +  w  for  i *  1,2 

where  IL  is  r^xn  real  constant  matrix,  and  w^ ~  N(0,A^) . 

Let  n±  denote  the  information  available  to  D>L.  We  consider  in 
this  section  the  following  information  structure 


n 

o 


yl’y2’ul’u2 

yl 


We  denote  an  admissible  strategy  of  DM_.  by  '{.  which  is  a  Borel  measurable 
mapping  from  the  information  space  into  the  decision  space,  and  we  also  denote 
the  SDace  of  admissible  strategies  of  DM.  bv  7 . . 

=  L  j. 


!» 
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Let  us  define  the  expected  value  of  G  (x,y  (n  ) ,Y, (n, ) ,Y„(n„)) 

1  o  o  11  2  2 

with  respect  to  the  random  variable  x  by 

t 

Ji^Yo'Yl'Y2^ ’  i,e*  Ji^Yo,Yl,Y2*  "  E{Gi(x’Yo(V*Yl(V’Y2(V)} 

The  coordinator's  goal  is  the  achievement  of  the  global  minimum  of  J^,  and 

his  role  is  to  adopt  a  strategy,  YQ(no).  which  induces  DM^  and  DM2  to  play 

with  him  as  a  team  with  Jq  as  the  objective  function,  even  though  there  is 

no  explicit  cooperation  between  them.  If  y  (n  )  is  substituted  in  J . ,  and  J, 

o  o  i  i 

is  minimized  over  in  the  Nash  sense,  and  if  the  resulting 

solution  leads  to  the  minimization  of  Jq ,  then  the  optimal  coordination 
problem  is  solved. 

By  Nash  solution  we  mean  that  if  y  (n  )  is  a  given 

o  o 


strategy,  then 


J1(yo’Y1,Y2^  “  J1(yo,Y1’Y2^ 
J2^Yo’y1,Y2^  *  J2^Yo,Y1'Y2^ 


6.2.1.  Derivation  of  the  Optimal  Equilibrium  Strategies 
We  start  by  finding  the  optimal  team  solution.  Let 

min  min  min  J  (ya,Y,.Y,)  *  J  (y^.Y, .Yo)  • 
y  sr  y.sr,  y0€"t 

o  o  1  12  2 


Lemma  6.1:  If 


s 

a,  L, 

o 

1  1 

«lLi 

'JiDi 

ot  R  +a  R,  i  >  0 

II  L  L  I 


Chen  che  globally  optimal  team  solution  exiscs  and  is  given  by 


e 


i  <•. 


I 
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\ S (no)  "  ■io1[alLlul  +  °2L2u2  +  V(xIyl’y2)1 


where  T0  *  a1T1  +  a2T2  and  S0"alSl  +  a2S2 


yj(ni)  -  Kiyi  for  i-1,2 


where  satisfies  the  following  matrix  equation 

+  (a1R±  +  a  Rj -a^  L^S"1^ )  Kj  Hj  (H^EH^  +  A^"1  ] . 

Proof :  The  proof  Is  straightforward  and  can  be  obtained  by  using  standard 
stochastic  control  techniques  under  the  nested  information  structure 
consideration  [48,43] 

The  optimal  team  solution  can  be  expressed  as 

Yo(yl,y2’Vu2)  '  holyl  +  h02y2  +  ho3ul  +  ho4V 


Yi  *  Ktyi 


for  i  ■  1,2. 


The  coordinator's  problem  is  to  adopt  a  certain  form  of  strategy,  in  which  he 
incorporates  the  decisions  of  DM^  and  DM2,  and  by  which  he  can  force  the  two 
DM's  to  cooperate  while  each  DM  is  minimizing  his  cost  function  as  well.  Let 
us  restrict  our  investigations  to  the  class  of  strategies  which  are  linear  in 
u^  and  u,, 

W  “  Yo(V  +  +  A2(u2_y2)  ' 

In  general  we  can  take  and  A?  as  matrix  functions  which  are  measurable 
with  respect  to  the  sigma  fields  generated  by  n  and  %  respectively.  But. 


for  simplicity  we  will  consider  only  constant  matrices.  It  is  clear  tnat  if 


VxV  A  JV.  V-  *-•- 


V  V  -  '.‘1C  -iA.  A  .t  v:  -a — A-  — 


-*  *  •  «  •  «  *  -  * ..  •-»  ■ 


Che  macrices  and  are  chosen  in  such  a  way  chat  Che  opcimal  response  of 

DM^  and  DM2*  and  U2  equal  Cheir  corresponding  team  decisions,  Chen  we  say 

chac  Che  scochascic  opcimal  coordinacion  problem  is  solved. 

To  find  Che  racional  reacCion  of  DM^  co  Che  announced  scracegy  of 

Che  coordinacor,  we  subsCiCuCe  y  (n  )  as  given  above  in  J . (y  ,u, ,u_).  Since 

00  1  o  1  z 

J. (y  (n  ) ,U. ,u_)  is  quadracic  and  if  scricc  convexicy  of  J.  on  T.  is  assumed, 
i  o  o  i  L  lx 

Chen  Che  minimum  of  wich  respecC  Co  u^  can  be  obcained  by  person-co-person 
opcimizacion  and  by  Caking  Che  gradienc  of  wich  respecc  co  u^  and  equacing  ic 
co  zero.  If  chis  procedure  is  carried  ouc  ic  follows  chac 

E[  (Diui+Qix+RiUj+(hoi+Ai)  's±  [  (hoi+AiHui-Kiy1)+(hoj+Aj )  (uj“Kj7j ) 

+hoi+2!'itto]+2!'J1  +Li(Al<ui-Kiyl)  +AJ(uj-K3:,j)  +hol"l+hoju3 

*03VVV+(Al*oi>  V,|yi>  ■  °' 


After  some  scraighcforward  manipulaCion  we  find  thac 


Yi(yi)  “  *MilvijE(xlyi)  "Nij  "°ijE(Yj(yj)lyi) 


(6.1) 


where 


Mi  *  V  (hoi+Ai)  ,Si(hoi4Ai)+Ll(Ai4iloi)  +  (Ai+hoi)  \ 

Vij  -  ^i+  ^hoi+Ai)  ,Si^o,2+j^j  +  ^ih2+j^j~^oi+Al^  'Si^oj+Aj 

-L.'a.K.H.  +  (A.+h  .)  'T. 
i  j  j  j  1  oi/  1 

N\  .  =■  (L'h„ ,.-(h  .+A. )  *S  (h  .+AJK  -L.’a.K.  +  (h  ,+A.)’S.h  ,..)v. 
ij  i  2+i  01  1  i  01  i  i  111  01  1  1  o2+i  •  1 

0  *  (R.  +  (h  ,+A.)'s.(h  +A.)+L.(A+h  .)]. 

ij  1  1  01  1  1  oj  j  1  j  oj 


That  >!  is  a  nonsingular  matrix  and  follows  from  the  strict  convexity  assumption 
of  J .  on  T  .  . 


The  following  proposition  gives  the  conditions  of  existence  and 
uniqueness  of  the  solution  of  (6.1). 

Proposition  6.1:  If  either 

or  *«:VjV  ‘ 1 


then,  there  exists  a  unique  linear  Nash  solution  (y^y^)  which  satisfies 
equation  (6.1),  for  each  pair  (A^.A^),  where  X(A)  is  the  maximum  eigenvalue  of 
any  bounded  real  symmetric  matrix  (A’A). 

Proof :  The  proof  is  equivalent  to  the  one  given  in  [49] . 

Let  Y*^)  *  in  (6.1),  then  we  have 

\  - -mI\3  v  “Aj  -  “I'vVi  t6-2> 


where 


Z±  -  IH;[(HlEH;[  +  Ai)':L. 


Equation  (6.2)  can  be  rearranged  and  written  as  follows 

Ai[llKl  +  Ti:i  +  Sl<ho,2+l+h0,2+jHjEi)1+DiKl  +  Vl  +  hoiILiKi  +  Ti:i 
+  “jV  1  +  I'i<ho2+i+h=2+jBJi:i) 


+  R.K.H.I  +  L>  .K.H,:.  -  0 

ijji  i  »jj  i  i 


or  equivalently 


Condition  6.1: 


satisfies 


a'b1  =  b* 

i  o  1 


There  exists  at  least  one  matrix  A.  for  i  =  i,2  which 

i 


A'B1  -  B1. 
i  o  1 


-  V’ 


v-. 


f: 


-  ^  •:  •  • 


i 

l\m 

i> 

§ 

.  j 


:■> 


P 


i 
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The  dimensions  of  A^,  B*,  and  B*  are  nj*m0»  mQxr^,  m^*r^,  respectively,  and 
we  have  ®^x®0  variables  to  choose  which  satisfy  m^xr^  equations.  Condition  6.1 
is  generally  guaranteed  if  mQ  >  maxCr^,^) . 

From  before,  we  can  easily  see  the  proof  of  the  following  theorem. 
Theorem  6.1;  If  Condition  6.1  is  satisfied  and  if  y  (n  ) S  T  is  picked  as 


Y_(n  ) 
o  o 


Al^Ul“Yl^  ~  A2^u2"y2^ 


then  a  solution  to  the  optimal  coordination  problem  exists 


6.3.  Linear  Quadratic  Leader-Follower  Games  with  Partial  Decision- 
Dependent  Information  Structure 

As  in  the  previous  section,  x-n-ditnens ional  random  vector  and 

x~N(0,I).  u^^  is  the  decision  variable  of  player  i  (Pj  ,  and  it  takes  values 

mi 

in  R  for  i»l,2.  Let  be  the  leader  while  is  the  follower.  The 
objective  of  the  leader  is  defined  as 


L1(x,u1,u2)  -  2  u1RHu1  +  u1R12u2  +  u1R13X+2  U2R14u2 +  u2R15X’ 


while  the  objective  function  of  the  follower  is 

1  », 


L2(X’U1»U2)  ’  u2R21X+2  U2R22U2  +  U1R23X+U1R24U2’ 


where 


R11,R22  >  °‘ 


Let  y  and  be  two  random  variables  defined  by 

v.  *  H.x  +  w.  for  i*  1,2 

•  1  1  1 

where  H.  is  r.vn  real  constant  matrix,  and  w  . —  N(0,A.).  The  leader  detects 
11  i  1 

the  action  of  the  follower  through  observing  v  ,  which  can  be  taken  without 
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loss  of  generality  as 

y0  -  (i  o)u2 

where  I  is  r*r-identity  matrix  with  r<m^.  Let  denote  the  information 
available  to  P^,  and  we  are  going  to  assume  the  following  nested  information 
structure 

ni :  vyry2 

n2  :  y2 ‘ 

As  before,  is  a  Borel  measurable  mapping  from  the  information  space  to 
Che  decision  space-  of  P^.  The  space  of  all  such  mappings  is  the  strategy 
space  of  (r±) . 

One  natural  lower  bound  for  is  the  infimum  of  over  the 
.  Tt. 

product  set  r^xr2  •  Let  us  assume  that  this  infimum  is  actually 

achieved,  so  the  leader  wants  to  adopt  a  certain  strategy  by  which  he  can 
force  the  follower  to  play  in  such  a  way  so  as  to  globally  minimize  J^,  even 
though  the  follower  intends  to  minimize  his  cost  function.  To  investigate 
the  ability  of  the  leader  to  enforce  the  team  solution,  let  us  make  the 
following  assumptions. 

Assumption  A;  The  control  value  of  the  follower  can  be  exactly  and  completely 
detected  by  the  leader  through  his  information. 

Assumption  B:  There  exists  a  strategy  for  the  leader  by  which  he  can  increase 
the  follower's  cost,  if  the  latter  does  not  abide  by  the  team  solution. 
Clearly,  if  the  above  two  assumptions  are  satisfied  then  the  leader  can  force 
his  team  solution  (see  [47]  for  details). 


Ill 


In  this  section  we  are  dealing  with  the  case  when  assumption  A 
is  not  satisfied,  thus  J*  is  not  the  realizable  tight  lower  bound  for  This  problem 
can  be  dealt  with  by  defining  a  new  modified  team  problem  for  the  leader 
which  takes  into  account  the  rational  reaction  of  the  follower  on  the 
undetectable  action  space. 

The  procedure  to  solve  the  problem  when  the  complete  detectability 
condition  is  not  satisfied  is  as  follows. 

1.  Minimize  J2^Y22,Y21,Y1^  with  resPect  t0  Y22Sr22’  where 


y226R2  and  r2-r21ur22 


Since  J2  is  quadratic  in  y22»  under  strict  convexity  assumption,  we 
obtain  a  unique  optimal  strategy  y*2  which  is  given  as  the  unique  map  f, 


where 


Y22  "  f(Y21’E(Yl'72)*  E(xly2^‘ 


Substitute  for  y22  in  to  obtain  a  new  cost  functional 

E(yx I y,)  ,E(x|y2)) .  The  infimum  of  J'^  over  the  product  set  is  the 

new  lower  tight  bound  (J*) .  If  the  infimum  is  actually  achieved,  then 

the  leader  can  force  this  modified  team  solution  as  will  be  shown  in  the 

following. 


6.3.1.  Determination  of  the  New  Tight  Lower  Bound  of  J 
Minimizing  J?(y2,»Yt^*Y^)  with  respect  to  y?0,  we  find  that 


V22(y2)  *  *R22}  [R22}  y21  +  E(x<  y2)  *  R2^  E f Y ! (y  1  ’ y 2  3 }  •r2>'i 


(6.3) 


where 


[r(1)1 

R21 

rR(i) 

R22 

r(2)1 

R22 

R21 

r(2) 

L  22  J 

;  R22  “ 

r(2)’ 

LR22 

r(3) 

R22  J 

R24 

rRU> 

lR24 

r(2)1 

R24  J* 

(3) 

Since  J2  is  quadratic  in  y22»  anc*  uBder  the  assumption  that  is  positive- 

definite,  the  minimization  is  obtained  by  person-to-person  optimization. 

2.  Substituting  the  value  of  Y22(y2)»  which  *s  obtained  above,  in  J^,  we 
get 

j1(x,Y1,Y21,E(Y1|y2),E(x|y2))  -  E{ \  YiRnYi  +  ^12^21 

‘YlR12)R22)"1(R22)'Y21+R21)E(xiy2)+R24),E(Yl!y2))+YiRi3X 

+  I  Y»  r51)y  '  r(2)r(3)-1(r,(2)  +R(2)Efx,  \  +  R(2) ' £ ,  ,  n 

2  y21R14  y21  Y21R14  R22  CR22  Y21  R21  Etxly2)+R24  ECy1  1  y2^ J 

+  1  (r’(2)  +  r(2)£(  ,  \  +  r(2)  \(y  !v  n,R(3)"1R(3>R(3)“1fR,(2)Y 

+  2  \r22  y21  +  R21  E <‘3C ' R24  E^Yl'y2^  R22  14  R22  tR22  f21 

+  R^)E(x|y2)  +R^2)  ,E(Y1|y2))  + y21R15>X*(R222)y21 

+  R22)E(x|y2)  +R22)  E(Y1iy2))'}. 


The  new  natural  lower  bound  for  the  leader  (J^)  is  the  infimum  of 


over  the  product  set  r^xp^.i.e.,  J*  >  inf 


Yl€ri,Y21€r21 


Ji°n’Yii,E(Yi'y'7)) 


3efore  we  state  the  theorem  which  gives  the  optimal  solution  which 


achieves  the  tight  lower  bound  of  let  us  assume  the  following: 
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,  R(l)  .  _(2)_(3)-l  (3).(3)-l  ' ’(2)  (2)  (3)-1  (2)*  (2)  (3)"1  (2)' 

i.  Ru  +1^2  R22  R14  R22  R22  -R14  R22  R22  R22  R22  Ru  is 

positive  definite. 

(2)  (3)-l  (3)  (3)-l  (2)»  (2)'  (3)’l  .  (2)  (3)“1  .  DOsitive  semi_ 

2*  z4  R22  R14  R22  R24  R12  R22  R24  R24  R22  R12  1  positive  semi 

definite. 

3  XrR"1(R(2)R(3)"1R(3)R(3)_1R(2)  '-R(2)  V3)~V  ^..Or1  (2)  \  1  <  1 

J.  ^1-R11^R24  R22  R14  R22  R24  R12  R22  R24  R24  R22  R12  ^  1 

where  X(A)  is  as  defined  in  the  previous  section. 

Theorem  6.2:  If  assumptions  1,  2,  and  3  are  satisfied,  then  there  exists  a 

unique  optimal  team  pair  (y^»Y2^)  for  the  modified  objective  function  J^, 

where  yj  and  y^  can  be  expressed  respectively,  as 

T5  '  hou21  +  hlyl  +  h2y2 


Proof ;  is  quadratic  and  under  assumption  1,  it  is  strictly  convex  in  y?^, 
so  the  optimal  y0  can  be  obtained  by  person-by-person  optimization,  and  bv 
taking  the  gradient  of  J1  with  respect  to  y9  and  setting  it  equal  to  zero. 

X  ^  X 

The  optimal  y^  is 


Y 21  -  kE(Yl!y2)  +  KECx|y2) 


wnere 


f  (1)  (2)  (3)-1  (3).(3)-l  (2)  ’  (2)  (3)'1  (2) ' 

^R14  +^22  R22  RT4  R22  R22  14  K22  ^22 

(2)  (3)-l  (2)’  -1  (1)’  (2)  (3)"1  (2)-1  (2)  (3)-1  (2)' 

-tS2  ^2^  K14  '  ^  i4  ’  k22  K7?  K24 

f  (1)  (2)  (3)"1  (3)  (3)"1  (2)'  (2)  (3)"1  (2)’ 

*22  ^22  ‘^22  R22  i<+  R22  R22 

(2)  (3)"1  (3)’  -1  (2)  (3)-1  (2)  (1)  (2)  (3)-1D(2) 

^22  ^22  14  K22  ^21  ±5  ^22  x2  ‘^15  '  * 


To  find  the  min  with  respect  to  y^»  we  determine  the  first  Frechet  varia¬ 
tion  6J^  of  J^,  and  set  it  equal  to  zero.  After  some  lengthy  manipulation, 
in  which  we  use  some  properties  of  the  conditional  expectation,  we  get 

R11Y1+^2U21+  K2E(xly2}  +k3E(Y1|y2)  +  R^ECxjy^y^  -  0 


where 


7  .  R(l)  p(2)p(3)“J*t}(2) '  (2)  (3)--L.(2)*  .(2)  (3)’  (3)'  (3)-1_(2)’ 

2  *  *12  "R12  R22  R22  "R24  R22  R14  R24  R22  R14  R22  R22 

r  =  (2)  (3)~1  (2)  (2)  (3)'1  (3)  (3)-1  (2)  (2)  (3)-1r(2) 

2  "R12  R22  R21  24  R22  ^.4  R22  21  R24  R22  15 

=  (2)  (3)-l  (2)’  (2)  (3)’1  (2)’  (2)  (3)"1  (3)  (3)’1  (2)' 

k3  -R12  R22  R24  "R24  R22  R12  24  R22  R14  R22  R24  * 

2 

Assumption  2  guarantees  that  the  second  Frechet  variation  6  J  is  positive 
semi-definite.  The  remainder  of  the  proof  is  equivalent  to  the  one  given  in 
[49],  which  uses  assumption  3  and  the  Gaussian  distribution  properties  to 
show  the  result.  . 


6.3.2.  The  Enforcement  of  the  Modified  Team  Solution 


Let  the  leader  adopt  a  strategy  which  is  of  the  same  form  as  the 
one  described  in  the  previous  section,  i.e. 

Y1  =  Y1  +  A(u2r^2}  =hoU21  +  hlyl  +  h2y2  +  A^u2rSy2)  • 

Substituting  for  y  from  above  and  for  y0_  from  (6.3)  in  and  taking  the 
gradient  of  J0  with  respect  to  u,  and  set  it  equal  to  zero,  we  get  the 


following  as  the  optimal  response 
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i  ’i- "3- ■.-'■iv.'-1' ■ .'.■■.■.,yL-\r»  .*'..■  ttt  :u- ■"'•■. '.■•'.v'-y."-'  •  .' 


E[{(R^)-R^)R^)“1R^))x+(R^)-R^)R^)  [(h+A) ’(R*^ 

A2)-(3)-\'(2)  .(2)  (3)-l(2)*  .  +A„  .  ’  (1)  R(2)  (3)'l  ’(2) 

R24  R22  R22  R24  R22  R24  Uo  A);  +  CR24  R22  R22  R24 

-<VA)  ,R24)R22rlR24)  )<ho+A)lu21+  <ho+A)  ’(R23* 

'R21)R22>"lR21>E<!'llr2)"R24,R22>"lR24) 'hlHlE(x|),2) 

-R24>43>‘Mr(h2-AE)y2>+(R2i)-R24>R22rlR222)' 

■R24)r22)"1r24>  'hiaiEU|y2)-R24>R22rlR24) '  <h2-Ai')y2) 

+  (RL1>-R2fR22)‘lR22)’-R24)R223)'lR242)'<V«>'<hl“lE(x!y2> 

(2)’  (21  “1  (21 '  (21  (31*1 

+(h2-Ah)y2)  +  (ho+A)'R^;  R£}  (ho+A)u21+(ho+A)'(‘R24  R22  R21X 

+  2  R24)r22>"1(R24>  'hlHl+R21>: )E(*!y2) +I  '  (h2*AS)y2} 

+  y  (hQ+A)  'R^^R^^  R24),(hlHlE(x^y2)+(h2*Ai')y2) 

+  \  (ho+A),R2J)R22)”1R^)E(x!y2)}!y7]  -0. 

Let  u21 *  u2i = hy2  in  the  above  equation.  After  lengthy  but  straightforward 
manipulation  we  get  the  above  equation  equivalent  to  the  following  linear 
matrix  equation 


BiA  ’  B: 


(6.4) 


vhere 

3, 


-,r,  r,  ,  (2)  Or1,^'  ,  ,  (2)  (3)-l  (2)’  (2)  (3)'1  (2)' 

h  .3-n  aQR24  R22  *24  •  L  r4~n2R24  R22  R24  hlR24  R22  R24 


_  _  Ml-!  ('’I'  -  ('M  (3V 

-L  ’  F'  -h '  F7-h '  F  'h  +h'h’R  '  R2,  h  -h’h'F.-L’Fjh  +  h^R,;  ’  R,., 

11  2  3  o  o  24  22  24  o  o  3  a  o  _  24  22 


R»?}  'h  -L’H'hjF  -h!F  -L’HJh  rW^'V21  'h 
24  o  11323  1  1  24  22  24  o 


,  ,  r(D  R(2)„(3)-l  (2) 
F1  R21  R22  R22  R21 

_  ,  R<1)  R(2)_(3)-1_(2)' 

F3  R24  "R24  R22  R22 


F  .  R(1)_R(2)_(3)-1  (2)’ 

’  2  R22  R22  R22  R22 

F  -  r  -R<2)R(3)"V2)  R(2V3)_1R(2)’h  H 
*4  R23  R21  R22  R21  R24  R22  R24  hlHl 


L  -  SH^(H2ZH^  +  A2)_1. 

The  following  theorem  summarizes  the  results  we  obtained. 

Theorem  6.3:  If  there  exists  at  least  one  matrix  A  which  satisfies  the  linear 
matrix  equation  B^A=B2>  and  if  is  picked  as 


yl  ■  ft*  A(u21-y21) ’ 


then  the  modified  team  solution  is  achieved. 

The  following  example  illustrates  some  of  the  basic  ideas  presented  in 
this  section. 

Example:  Let  the  objective  functions  of  the  leader  and  of  the  follower  be  as 
follows 


r,  2  n 

ui+ui[1  °) 


Ji  (i  ui+ui11  01 


21  .  ,  .  1  .  . 
*  2V  -  2  U21  u22 


rii  i 

U01  UT'>}  X 


1  0  u 


0  1  j  Lu22  J 


21  22 


r  r°l  i  ri  °]  ru_.  ]  .  fu,. 

r2=E‘|[u21  U22!  X  +  2  [u21  U22]  !  '  j+UjX+UjJl  glj 

L  L1J  L°  lu22  _!  Lu22 
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For  simplicity,  let 


7  "  yl  "  y2  "  x 


,  and  yQ  -  u21 


where  x  is  a  scalar  random  variable  with  x~N(0,o), 


If  we  minimize  J 2  with 


respect  to  u22»  we  c^at 


*  y  +  8  E(YiIy)‘ 


If  we  substitute  for  u 22  in  J^,  and  find  the  infimum  of  with  respect  to 


and  ,  the  result  will  be 


y'l  ’  -u21-T’,-64  E(,flly> 


Y 2i  *  ECYj^Iv)  +  4y. 

Solving  the  above  three  equations ,  we  get  the  modified  team  solution  as 


V*  -  -0.9846u21  -  2.092y 


Yj1  -  0-961y 


y"2  *  0 . 6202y . 


It  can  be  easily  checked  that  if  the  leader  chooses  his  strategy  of  the  form 


Yx  -  -0.9S46u21  -  2.092y  +  aCu^-O .961v) , 


where  a  »  4.166,  then  he  can  enforce  the  above  modified  team  solution,  and  the 


value  of  his  objective  function  (J*)  will  be 


J*  =  0.11727c. 


For  this  example,  is  not  strictly  convex  in  the  decision  variables,  so  the 


global  minimum  of  J,  does  not  exist.  J"  is  the  tisht  lower  bound  for  J,  which 

1  1  L 


the  leader  can  achieve,  i.e.,  values  of  which  are  less  than  cannot  be 


induced , 


CHAPTER  7 


CONCLUSIONS 

In  this  thesis,  the  role  of  information  structure  in  some  Nash  and 
LF  games  is  considered.  We  show  that  by  preserving  the  information  structure 
of  the  full  order  singularly  perturbed  LF  games,  reduced  order  solutions  which 
are  equivalent,  in  the  limit  as  u  tends  to  zero,  to  the  full  order  ones  are 
obtained,  but  by  using  DDIS,  solutions  which  are  different  from  and  more 
desirable  than  the  NIS  solutions  are  obtained.  For  example,  in  LF  games,  by 
using  DDIS,  the  leader  can,  under  certain  conditions,  achieve  his  most 
desirable  solutions,  which  normally  he  cannot  achieve.  We  investigate  several 
classes  of  Nash  and  LF  games  with  DDIS. 

By  preserving  the  information  structure  of  the  full  order  problems, 
while  solving  the  reduced  order  ones,  we  designed,  in  Chapters  2  and  3,  two 
well-posed  methods  to  obtain  reduced  order  and  near  optimal  strategies  for 
both  linear  closed  loop  and  team  LF  games. 

In  Chapter  4,  a  class  of  two-person  Nash  games  with  DDIS  is  considered. 
Necessary  conditions  for  existence  of  a  Nash  equilibrium  solution  with  DDIS  is 
derived  for  a  single  stage  general  duopoly  model  of  a  market  structure.  The 
case  of  linear  market  demand  function  and  quadratic  cost  function  is  analyzed 
in  detail  and  it  is  shown  that  the  profit  of  the  firm  with  DDIS  is  more  than 
its  corresponding  profit  with  NIS,  but  the  profit  of  the  other  firm  is 
decreased  compared  to  its  profit  in  NIS.  We  extended  our  analysis  of  the 
concept  of  DDIS  to  multistage  dynamic  games.  A  two-stage  duooolv  game  with 
DDIS  is  examined  and  sufficient  conditions  for  the  existence  of  an  eauilibrium 
solution  of  discrete  linear  quadratic  Nash  games  with  DDIS  are  ziven. 


In  Chapter  5,  we  showed  the  significance  of  using  decision- 
dependent  strategies  by  the  leader  in  forcing  cooperation  of  the  Nash  followers 
in  a  LF  game.  We  gave  a  static  model  of  a  market  which  consists  of  two  firms 
and  the  government.  The  two  firms  behave  as  Nash  duopolists,  while  the 
government  behaves  as  a  coordinator.  If  the  government  adopts  a  certain  repre¬ 
sentation  for  its  strategy,  which  is  affine  in  the  output  rates  of  the  two  firms, 
then  it  can  always  force  the  two  firms  to  cooperate.  We  analyzed  in  detail 
the  case  of  a  linear  demand  relation  and  quadratic  cost  functions;  we  found 
explicit  solutions  for  the  optimal  strategies  of  the  firms  and  the 
government;  and  we  compared  the  voluntary  Pareto-optimal  situation  with  the 
enforced  Pareto-optimal  one. 

In  Chapter  6,  we  solved  two  stochastic  static  LF  team  problems,  where 
each  player  has  a  quadratic  cost  function  and  the  random  variables  are 
normally  distributed.  The  first  problem  is  a  3-person  stochastic  optimal 
coordination.  We  showed  that  under  a  certain  condition,  the  coordinator  by 
adopting  strategies  which  are  linear  in  the  decisions  of  the  Nash  followers 
can  enforce  cooperation.  The  second  problem  is  a  2-person  LF  team  game,  in 
which  the  leader  does  not  completely  detect  the  decision  variable  of  the 
follower.  If  the  complete  detectability  condition  is  not  satisfied,  then  the 
leader  cannot  enforce  his  global  optimal  solution.  We  defined  a  new  modified 
team  problem  in  which  we  took  into  account  the  optimal  response  of  the 
undetected  action  of  the  follower.  We  found  out  that  the  leader  can.  under 
a  certain  condition,  achieve  this  new  tight  lower  bound. 

In  the  area  of  information  structure  in  Nash  and  LF  strategies, 
there  are  several  avenues  which  have  yet  to  be  explored,  such  as 
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1.  The  sensitiviCy  of  Nash  and  LF  equilibrium  solutions  to  both  uncertainties 
and  changes  in  the  information  structure.  In  particular,  we  may  try  to 
develop  robust  strategies  (strategies  which  are  Insensitive  to  uncertain¬ 
ties  and  changes  in  the  information  structure) ,  which  the  leader  can  adopt 
to  achieve  his  team  solution. 

2.  The  study  of  information  structure  design,  i.e.  who  should  know  what.  We 
are  still  at  a  very  elementary  stage  and  several  difficulties  have  to  be 
overcome,  before  we  are  able  to  answer  the  above  question  in  a  unified  and 
systematic  manner,  such  as,  a)  A  deeper  understanding  of  the  subject  of 
dynamic  information  structure,  b)  more  investigation  on  the  matter  of 
incentives  and  decision  dependent  information  structure. 

3.  Investigation  of  the  generality  of  the  effectiveness  of  preserving  the 
information  structure  in  obtaining  near  optimal  reduced  order  strategies 
and  well-posedness.  For  example,  we  can  check  whether  preserving  the 
information  structure  of  the  full  order  problem,  while  solving  the  reduced 
order  ones,  leads  to  well-posed  solutions  for  some  classes  of  nonlinear 
game  problems. 


* 
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APPENDIX  A 

NECESSARY  CONDITIONS  FOR  THE  LEADER'S  MINIMIZATION  PROBLEM 

Applying  the  matrix  minimum  principle  [14]  to  the  leader  optimi¬ 
zation  problem  we  get  the  following  set  of  matrix  algebraic  equations: 

PlAil  “  ¥lAl  “  ‘  P1K12313TT1  ‘  P1K12^12  +PlA21rri  +P2K2TT2 

+p2k12s  ’  2  +  p2k12s23tt;  +  P2K2  J21  +  P2K2 1s22rTi +  WnKl  +  >  ' 

P2^21  (^22 ^  '  ‘  ^13^22  +lt13^23np  *  P2*21^22^  Q23^22f^12  "*"®13TTi ^ 

'iAllPl  ’  B11K11P1  "  B12L21P1  *  TTl313Ki2Pl  "  312K12P1  +TT1A21P1  + 

n2K2P2  +TT1322.K11P2  +TT1323K12P2  +  B21K11P2  +  B22K12P2 
(rTl323K13  +  322K13)A22A21P2  +  (TT1313+312)  *  ('A22)  Q22P2_  (312+r*1^13)  (^22  *  ^23 


(A22)A21P2  *  0 


P2A^+A0P2  +1  -0 


(A.l) 
(A  .2) 


’B12K11P1  "  7T3S13K12P1 +  n3A2lPl  "  B22K12P1  +  ^2^  +  R22L21P2 
+  I3S23K12P2  +  "3^22R11P2  +  R12L21P1  ’  ('R22L22  +  '3S23R13',A22A21P2  + 

(  '3B13  B22 ^A22 ^  ^22 P2  ”  ^®22  +  3B13^A2'’ ^  l^',3A2',A',lP'>  *  ^  (A- 3) 

"7T5P  1^12^13 ^A22 ^  +  5PLA21  ^A22  ^  "r  3P2R2  6  +  32R2P2A^ ]_  (A,2 )  1  t  ■•!?., ) 
~"5P2R12B23t'A22^  ~R22^2lP2A2l  ^22^  '  ~  5^2^21^22^  ’  R13B’’  3  ^  >  (A,  *)  'Q'  , 

^2A21  ^A22  ^  "5P2^22A22B1 3  ^2  ^  ~ ^  (A., 0  )  '  Q'  ^  ^  ^  (An  )  1 

■4-322(A,,2) 'Q13(A^.,)A,  ,  ?^A^i(A  •,})'  +•  Rr,  L , ,  i,A ,  |)  A  ,  ^  1  (A . !  i 1  - 

^12^*22  ”  B22R13^  P3  *  ^  (A.  4) 
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”^22*21Pl*21  +A22S13K12P1A21  +*2lP2lt2  <S12  "  *12*22S13)  ^22*  ' 
+A22A21P2K2<P12  'A12A22^13)  **22*21^22*22*13  '^21^2^22*13  ^225' 
-*22l£21P2KU?22-i221^21P2K12523  +*21P2*21 ' <*£>  ’K13*23 ' <*H> '  + 

S2lV21<^2)  (*22,^21P2*21(^22)  '?2 £&} +  <S22^21P2 

A21(A221)'K13S23+S22P3'^22)522KHP2i21422l523Ki2p2X2r*21PlS21(*221)' 


I 

21 


1^1^12  ^22^  "^*  (^22^  ^12”^12^22^13^  ^2^2^21"**  ^12*^12^2^^13^  ^2P2^ 
(^20 ) '  I^?,j^>2^2  1  ^22^  ” ^?2^ “*13 ^*>2  ^  9^2^21”^  2^1 1P,,^2  1  ^2 7 ^ ' 

'^23K12P2^21(^22)  '  +  +  (^22^13 (^22 } 


‘^21^2^21  +  ^13^22^  '^23^22^21P2^21^22  ^  +  ^23K13^22^21P2*^2l  ^22^ 
+P3^22~^'2lP2^11'*22  ^22^  ”^21^2^12^23^22^  *  ^  (A.. 5) 


A 


t  C  t*  fS  "  \ 


wnere 


-  A-l 

1  *  ^’A12  +  ^12^22  +  ^12^13^22 


2  3  "Sll'S12"i+^12^22(S12  +  S13“P 


3  *  ^"P12^22  ^22^13^22 


^  ^  "  1  ,  “  _ 

'/t  *  C  ”^1'1  3~°1'?  ™  t"1*'  **  1  ®  ^  "3  "*"  ^  1  -\>  I 


13  3' 


5  3  "^‘"21^12" '^11^12  ~  ^12  ~  ^2ltv12i“22  ‘^11^12  ”r"L1^12l'2I 

„  '  ~  L  , 

+  A21k13^a22322J' 
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A 


17 6  "  "S12(^22^  '  +^12^22S13^22^  »  B2  “  B12~^12^22B22 


So  in  order  for  Che  leader  Co  find  I^*  he  has  Co  solve  equacions  (2.3), 


(2.6),  (2.7),  (2.8),  and  (A.l)  through  (A.5) 


I 


> 
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APPENDIX  B.  DECOMPOSITION  OF  THE  FULL 


ORDER  LEADER-FOLLOWER.  TEAM  PROBLEM 


Equations  (3.6)  -  (3.11)  of  the  full  order  system  is  decomposed  and  the 
limit  as  u  tends  to  zero  is  taken  as  follows : 

Substituting  the  forms  of  K(t,p),0^  as  given  before  in  equation  (3.6)  and 
letting  y-K)+,  we  get 

K  (t,0)  +K1(t,0)A11(t)  +K2(t,0)A21(t)  +AjA(t)K1(t,0)  +  A^(t)K^(t,0) 

+  Qu(t)-K1(t,0)S1K1(t,0)-K2(t,0)S'K1(t,0)-K1(t(0)S2^(t,0) 


-K2(t,0)S3K£(t,0)  =  0 


(B.l) 


K1(t,0)A12(t)  +  K2(t,0)A2,(t)  +A’1(t)K3(t,0)  +Q12(t)-K1(t,0)S2K3(t,0) 


-K2<t,0)S3K3(t,0)  *  0 


(B.2) 


K3(t,0)A22(t)+A’2(t)K3(t,0)-K3(t,0)S3K3(t,0)+Q13(t)  =0.  (B.3) 

By  letting  u— 0+,  in  equation  (S)  we  have 

R22(t)[B12Pl(C’0) +B22P3(!:’0)]  =  ^2(t)[B12Kl(t,0) +  322K2(t-0)]  (B*4) 


R2J(t)B’,P4(t,0)  *  R"^ ( C) B^2K3 (t ,0) . 


(3.5) 


Equation  (3 . 10)  will  be  after  letting  U“0 

F  (c,0)  =  R21Ru(B{1K1(t>0)  +  B^1K^(tf0))-B*1P1(t,0)-B’1P3(t,0)  (B.6) 


::(t,0)  -  R21R^‘5'1K3(t.0)-3;iP4(t,0) . 


(B .  7) 


decomposing  equation  (3 . 11) ,  where  the  gains  of  the  controls  are  evaluated  at 


i 
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All"SllKl(t’0)"S12K2(t,0)'S21Kl(t,0)‘S22K2(t:,0)A12 

-S12K3(t,0)-S23K3(t,0) 

A2rSi2Kl(t’0)"S13K2(t’0)“S22Kl(t»0)‘S23K2(t’0) 

V 

(A22)“S13K3(t*0)"S23K3(t,0))/u 


(B.8) 


where 


-T  I2+yIS 


I^+pTS  +yS 


uT  *  A22T  -  uTA1;l  +  uTA12T  -  A21 

:-S  =  -u[A1;L-A12T(t,u)  ]S  -  S[A22+ uT(t,u)A12]  -  Ai: 

Substituting  for  x,z  in  terms  of  v^,  v?  in  (B.8),  we  get 

“  ^All”A12T^t,;j^vi 
U<r2  *  (A22  +  yT(t’y) 


(B.9) 

(B.10) 


Let  ■'^^(t,tQ,-;) ,  *'22(t,tQ,u)  be  the  state  transition  matrices  of  (B.9)  and 
(B.10)  respectively,  then  they  satisfy  the  following  equations 

■ll(C’to”')  =  ^A11-A12T^C,U^  ^ll^^o’"^  c0  ’  Co  *  ^  =  1  (3.11) 


*  (‘^22  +  ^  ^  A12^22  ^  ’  ~o  *  '  ">2  ^  ‘"o  ’  Lo  ’  ~  “  (B.12) 
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*11  - 

*n“ 

,0)  - 

S12K2(t,0)  - 

S21Kl(t 

A12  * 

^2  “  S13K3(t 

.0)  - 

S23K3(t,0) 

A21  * 

A21  "  S12K1(C 

.0)  - 

S^K1(t,0)  - 

S23K^(t 

A22  * 

A22“  S13K3(C 

,0)- 

S23K3(t,0) 

Sll- 

hihlKv 

S12 

■  \lRUB21' 

*13  ' 

S2i 

B12R12Bi2: 

S22 

"  B12R12B22; 

S23  " 

[23], 

we  have 

rV 

lnB2: 


lim  T(t,y)  *  A~2A21 
u-K3+ 

lim  S(t ,vi)  -  -A12A*J. 
u+0+ 


Substituting  for 


[;]-L  ^2]  (3.2),  and  then  letting  y-<0+,  we  have 

i_^ru  • 


/  (dsn1(t,s,0)-dsn2(t,s,0)A2^A21)4'11(s,t,0)  *  -R^J (B^Kj^Ct ,0)  + 

0  +B^1K’(t,0)]+R^1K3(t,0)A2^A21  (B.13) 

dsn2(t’s,0)4'22(s,t’0)  "  “RliB21K3(t’0)-  (B.  14) 

Co 

Using  the  transformation  y  *  L^1  in  (3.4) ,  and  letting  u~0+,  we  get 
Cf 

PL(t.O)  -  /  [(QJ1  +  AJ1P1(T,0)  +A’1?3(t,0)  +  nj(t,t,0)F1(T,0)) 

“  (^22  +  A21P4^T  ^]_(~’^’3)fT(^,0))A^  ?A,^  ]  ^1(T,t,0)dr  =  0  (B.15) 

Q22  +  A21?4(t’0)  +  i{(-.t,0)F1(-,0)  =  0 

Q22  +  A12?1(t’0)  +  A;2P3(T,0)  +  n;(T,tf0)F1(7,0) 

4-  X  ' 

4.3  A22‘  4  '  ‘ ’  '  '2 


Q„*A'  ?  (t,0)  +  nI(7,c,0)F,(r,0)  =  0 


Yt< 
0  Yt< 
Yt  < 


(3.16) 
(B. 17) 


(3.13) 


APPENDIX  C.  THE  SUFFICIENT  CONDITIONS  FOR 
THE  EXISTENCE  OF  THE  FAST  TEAM  LEADER -FOLIO WE R  GAME 


The  sufficient  conditions  for  the  existence  of  an  optimal  leader- 
follower  fast  strategy  pair  which  coincides  with  a  team  fast  strategy  pair 
are  as  follows : 

If  there  exists  a  function  n£(t,0)  with  n£(t,9)*0  for  8>t  and 
matrix  which  satisfy 


/  dgnf (t,s)$f (s,t)  - 

R22B22K2f  “  R12B22Klf 


t . 
t 


uK2f (t)  -  /  (Q71(t)  +  A,7(x)K7f(T)  +  n^(T,t)Ff (t) ) <pf (T,t)dr  -  0 


23 


22  y  2f 


uRlf +  KlfA22  +  A22Klf +  Q13"Klf(B21RllB21  +  B22R12B22)Klf  “  0 


(C.l) 

(C.2) 

(C.3) 

(C.4) 


Kif(tf) 


wnere 


Ff(C) 


R21RllB21Klf"B21K2f 


(A22"B21RllB21Klf "B22R22B22K2f ) ( C ’ to) 


(C.5) 
(C.  6) 


then 


3f(t  »t-)  ■  I 
too 


ulf  *  ^  dsnf(t’s)zf(s) 
u2f  “  ~Rl2322KlfZf ^ 


is  the  optimal  fast  leader-follower  strategy  pair.  Furthermore,  this  is  an 


optimal  team  solution. 
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APPENDIX  D.  THE  SUFFICIENT  CONDITIONS  FOR  THE 
EXISTENCE  OF  THE  HYBRID  SLOW  TEAM  LEADER-FOLLOWER  GAME 

The  sufficient  conditions  for  the  existence  of  an  optimal  leader-follower 
team  hybrid  slow  strategy  pair  are  as  follows.  If  there  exists  a  function 
n  (t,0)  with  n(t,0)-O  for  9>t  and  h.  xn,  matrix  P  which  satisfy 

3  S  XX 

t  , 

/  [dgng(t,s)-dgnf(t,s,0)A22(A21-B21Mlg-B22M2g)]4.s(s,t)  -  -M^  (D.I) 

o 

^22(T)'^23(T)Mls<T)  +  ®i2(T)Ps<T)"B22(T)<A22),nf(T,t’0)Fs(T)“^22M2s"0 


for  t<  t 


Pg(t)  -  f  [Q2l"^22M2s“^22Mls  +  AllPs(x)  +  (ns(t,t) 


■<A22A21),nf(T’t’0))Fs(T)^s(T’t)dT  *  0  Vt<T 


(D.2) 


(D.3) 


where 


Q’2(T)-R21Mls(T)  +  B;iP5(T)+Fs(T)-8jl(A^),nf(t>t,°)Fs(T)-^3M2s(T).0 


and 


then 


9s(t,to)  =  (All"BllMls'B12M2s)<(>s(,:’t:o) 


(t  »t  )  =*  I 
s  o  o 


c  .-1. 


(D.4) 

(D.5) 


-  /  (d  n  (t,s)-d  nf(t,s)A^A  )x  (s)-/  d  nf(t,s)A^B  u  (s) 


Is 


22  21  s 


22  21  Is 


-/  dgnf(t,s)A2^B22u2g 
o 


“:s  *  -M2sxs(t) 


constitute  an  optimal  leader- follower  strategy  pair  which  coincides  with  the 


team  solution. 
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APPENDIX  E.  PROOF  OF  LEMMA  3.4 


Define  Y  -L  where 
o  o 


I2+wT 


J  L  ■ 


I^+uTS  +yS 


and  T,S  satisfies 


pT  -  A22T  -  UTA11  +  uTA12T-A21 

US  -  -u[A11-A12T]S-S[A22+  uTA12]-A12 


then  i|»  (t,t  ,p)  satisfies 
o  o 


•  A. -  -A. -T ( t , u)  0 

v  (t,t  ,u)  -  •  *(t,t  ,u). 

°  °  0  A22^TAI2  °  ° 

u 


In  a  proof  similar  to  the  one  given  in  [22.  p.  161.  we  can  show  that  T(t,u) 
is  continuously  differentiable  and  bounded  for  all  t>tQ  and  Vu€  [0,_*) 
where  u*  is  small  positive  parameter,  and  it  satisfies 

T(t,jj)  -  A22A21  +  0(u). 

To  prove  the  lenana,  it  is  sufficient  to  prove  that  (t,t  ,u)  and 
S  01  o 

“2 

’{/nn(c,C  ,u)  —  are  bounded  in  the  limit  as  u  tends  to  zero,  where  , 

w*.  O  U  Ui  O 

and  'i)g2(t,to,y)  satisfy  the  following 

^01^t,Co,iJ)  *  ^A11~A12A22A21  +  ® (y> )  ^01  ’ Co 


->  ( t ,  t  ,  4 ) 
Ui  o 


A,2  +  0(y) 


}  i;02(t’t:o,ij) 


since  A^-A^A^A.,^  is  bounded,  so 


.  S-m. 
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